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Abstract 



The first goal of this paper is to show that discreteness, locality, and relativistic 
covariance can peacefully coexist if the ordinary spacetime (OST) is replaced with 
phase spacetime (PST) as a geometric background of a Poisson process, where PST is 
O^, a spacetime generalization of a notion of phase space (this is a 7-dimensional version 

! of the 8-dimensional structure proposed by Caianiello). Furthermore, Caianiello's idea 

of finite acceleration is implemented. After this is done, the paper then goes on to 
generalize the geometric notions obtained from the intuition of PST to a general discrete 
^ I causal set, without any geometric background. It then takes advantage of the absence 

QQ ' of lightcone singularity to attempt to tackle the definition of PST- like causal set; that 

0^ ■ is, a discrete system that approximates the geometrical properties that we would expect 

I from continuum PST. Finally, the paper proceeds to introduce quantum field theory on 

a causal set, and shows that the locality gained by switching from OST to PST brings 
us one step closer to be able to treat quantum field theory on a causal set analytically 
, rather than numerically. 

> 

X. 

^ , It is well known that several approaches to quantum gravity rely on discretization of 

spacetime. If one is to perform that discretization in a relativistically covariant fashion, one 
would replace a continuous spacetime with Poisson distribution of points on a continuum 
background. However, as was discussed, for example, in [7], there are non-locality issues 
related to Minkowskian nature of space-time. In particular, it can be easily shown that the 
volume of a set of points whose Lorentzian distance to a given point p is smaller than e is 
infinite. Since in discrete scenario the distance is interpreted as a number of causal links, 
this implies that any given point has infinitely many other points to which it is connected 
by a single causal link. 

Furthermore, at least in a flat Minkowski space, for any given point p, and any reference 
frame, if we are to select a random point q whose distance to point p is less than e then the 
difference in coordinates of these two points is arbitrarily large, with absolute certainty. This 
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implies that if we are to introduce fields, such as scalar field, its variation between these two 
points might be arbitrarily large as well. Likewise, if we are to look at a curvature to analyze 
gravitational field, this can also vary by a very large amount between these two points. In 
continuum scenario this is not an issue because in order to take derivative of the field we are 
only concerned about the limit. Thus, the fact that e is finite, however small it might be, 
makes the above issue irrelevant. In discrete case, however, we have no choice but to define 
"derivatives" in terms of finite distances, which makes the issue crucial. 

Let us now analyze the root of the problem, in order to see how to address it. Our 
inability to distinguish different points q that have the same Lorentzian distance, e, to point 
p is related to the fact that one can perform Lorentz transformation that takes qi to 
while keeping p fixed (here, the spacetime is assumed to be flat). Thus, if originally qi was 
on t-axis, while q2 was near the light cone, and if our field distribution was "slowly varying" 
in the pqi frame, it would be seen as fast varying in pq2 frame. 

This can also be explained as follows. In case of Newtonian physics, if we take two 
arbitrary frames, they will be moving with arbitrary high velocity relative to each other. 
The analogy of this statement for relativistic case is that two arbitrarily chosen reference 
frames move arbitrary close to the speed of light relative to each other. Thus, if we take 
two arbitrarily moving trains, due to Lorentz contraction, the length of each train will be 
arbitrary small in a reference frame of the other one. If a train is represented as a field, then 
each train will see another train as a (5-function. The above Lorentz contraction argument 
continues to hold if a train was replaced with some smooth field distribution, which is the 
ultimate cause of the problem. 

Thus, the root of the problem is that the velocity of the trains relative to each other 
is arbitrary. This implies that the issue can be solved if we forbid the interaction between 
physical systems moving with very different velocities. This can be easily done by encorpo- 
ration the idea of Caianiello (see, for example, [16]) of adding the "velocity coordinates" to 
the spacial ones. In other words, we are "taking seriously" the concept of phase space and 
combining it with the concept of space time, to obtain the concept of "phase spacetime" 
(PST) as a basic background geometry. If the ordinary space time (OST) is represented by 
a d-dimensional manifold (which, for convenience, can also be denoted by M.ost = -M.) 
PST is represented 2d — 1 dimensional manifold, MpsT- This manifold consists of ordered 
pairs of the form (x, f), where x G J^ost and t> is a timelike tangent vector to AiosT at 
a point X, satisfying g^yV'^v^ = 1 (here, and throughout the paper, (+1, —1, —1) metric 
convention is used). 

Just like two points that are "far away" in OST can't directly interact, neither can 
points that are "far away" in PST. In both cases, the interaction occurs through other points 
"in between", however fast. The only difference is that in terms of a projection onto OST 
coordinates, the distance between the points that can directly interact is very small, although 
it is still finite as a result of discretization that we postulated. On the other hand, the scale 
of direct interaction in the projection onto PST/OST might be either small, or large. It 
will be shown in section 4.4 that it is reasonable to assume that the scale corresponding to 
kinetic term of the Lagrangian is small, while the one corresponding to the interaction term 
is large. However, in both cases the scale is still finite. The finiteness of the interaction scale 
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on PST/OST, however large it might be, allows us to enforce as small OST scale as we like. 

A field is no longer a function of x'^ alone, but rather it is a function of both and 
v'^. In fact, if and are different vectors, there is no reason to expect (j){x'^,v^^) and 
(j){x'^,w'^) to be similar to each other, just like in case of OST we do not expect the fields 
at two different points that happened to have the same value of projection on one of the 
axes to be similar, either. Thus, it no longer makes sense to speak of (f){x), as the value 
can be anything, depending on the choice of v, which, also, can be anything. In section 4.4 
we will find out, however, that if |f — < A, then interaction terms in the Lagrangian 
which correspond to, for example, 0^ in ordinary quantum field theory might couple v) 
and (l){x,w), where A is a large parameter corresponding to ultraviolet cutoff. However, in 
light of the small coupling constant, the values of a field at these two points are still fairly 
independent of each other. 

In such scenario, the non-locality problem can be addressed if we define continuity of 
to be a statement that 4>{x,v) ~ (f){x',v') whenever {x,v) ~ {x',v'), if {x,v) ~ {x',v') is 
defined as follows: 

1) X and x' are connected by unique geodesic, 7(0;, x') 

2) If v" is a parallel transport of v' from x' to x along 7(0;, x'), then Qf^uiv"'^ — v^){v"'^ — 
w^) ^ 

3) In local geodesic coordinates around x chosen in such a way that f° = 1, and = 0, 
one has x''^ ~ for any choice of /i. 

It is important to notice that the topology on the PST is not the product topology. To 
illustrate a point consider a fiat spacetime. If (x'^ — x^){x'^ — s^) << 1, but v^{x'^ — x^) = 1 
(and V = v'), then {x',v) is close to {x,v) according to product topology, but not according 
to topology presented here. Since the product topology, by definition, does not use any 
information about v in comparing x and x', one has no choice but to use Lorentzian distance 
to do the latter, which leads us right back to the infinities we were trying to avoid. On the 
other hand, the topology presented here makes explicit use of either v, or v', or both, in 
order to select a reference frame in which to compare x and x'. This distinguishes it from 
product topology, and also allows us to make sure that neighborhood of each point is finite; 
and in fact, can be made arbitrary small, as expected. 

In order to illustrate why the above does not violate Lorentz symmetry, lets go back 
to the case of OST. The only reason the notion of "Lagrangian density at a point" does 
not violate translational symmetry is that action is defined to be the integral of Lagrangian 
density over all possible points of the spacetime. Likewise, in PST case, if we were only 
interested in evaluating Lagrangian density at {xo,vo), this would, of course, violate Lorentz 
symmetry along with translational one. But the fact that we integrate over all possible [x, v) 
to get the action is the reason that neither translational symmetry nor Lorentzian one were 
violated. 

Towards the end of the paper it will also be shown that this approach also makes it 
more realistic to be able to do analytic computations on a causal set, rather than numeric 
ones. To illustrate this, consider the causal set Lagrangians defined based on OST, in [3] 
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and [3]. In the latter references, a neighborhood of any given point was defined in terms 
of Lorentzian distance, and thus was infinite. Thus, in order to define Lagrangian density, 
a sub-neighborhood of that neighborhood was selected. Each such sub-neighborhood, of 
course, had a corresponding "preferred" reference frame. 

In order for the choice of such frame not to violate relativity, the frame was defined as a 
function of the behavior of relevant fields. In particular, the frame was selected in a way that 
would minimize the observed fluctuations of fields (this, of course, was done in order to get 
rid of 5-functions discussed earlier). Now, the procedure of selecting reference frame based 
on specific behavior of the field is, of course, very non-linear, and can not be represented in 
terms of perturbations to linearity, either. This is the ultimate reason why this approach 
does not allow one to go from Lagrangians to propagators, and ultimately forces one to use 
numeric methods. 

On the other hand, according to the approach proposed in this paper, the quantum 
fields are functions on a PST, as opposed to OST. Since the neighborhood on PST is already 
finite, one does not have to select a sub-neighborhood. Since the latter is the only source 
of non-linearity, this modification opens the door for us to use ordinary perturbation theory 
techniques that are used in quantum field theory, and thus produce analytic results. 

2. PST view of general causal set 

2.1 OST view of a causal set: a review 

Even though the approach proposed on this paper replaces the OST with PST, it still 
borrows other aspects of the philosophy of causal set theory proposed by Rafael Sorkin. For 
that reason, it will be fruitful to review the original, OST-based, concept of a causal set, and 
then the PST concept will be introduced on the next section. 

Suppose we have an OST, represented as a manifold = AiosT, with a metric g^u- 
Then the causal structure on can be viewed as partial ordering, -<M,g- For any p and 
?! V ^M,g Q if and only if p and q are connected by at least one timelike path 7 (timelike 
path is also sometimes referred to as causal path). In other words, 7(ti) = p, 7(r2) = q and 
Qnudl^dl'^ > 0. This partial ordering is referred to as causal structure. 

Malement and Hawking have shown that there is a bijection between g^u/\9\ and -<M,g- 
Furthermore, in case of Poisson scattering of points, the uniform density of the scattering 
can be used to approximate \g\. This would imply that -<M,g approximates gf^u itself. At the 
same time, on a microscopic scale, uncertainty principle implies that quantum fluctuations 
of gravitational field break down manifold structure, so gf^i, can no longer be used. A funda- 
mental assumption of causal set theory is that partial order persists even on that scale, thus 
the latter is viewed as more fundamental than the metric, while metric is only an emergent 
manifestation of that partial order on a larger scale. 

In light of this, geometrical quantities have to be defined on some more general (locally) 
finite set S, with a partial ordering -<s- For example, for any pair of points p -<s q, the 
distance between p and q, Ts^^g{p,q) is defined as the product of some very small real 
number, tq, with largest possible integer n for which one can find a sequence p -<s f^i ~<s 
■ ■■ -<s fn-i -<s Q- The number tq is interpreted as the length of a single link, or atomic scale. 
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It is introduced mainly in order to be able to carry out approximations in chapters 3 and 4, 
which depend on e6 « 6, where the distance scales defined by e and S are very small, but 
still large enough to include chains of large numbers of elements. This requires the length of 
each link to be much less than 1, which is the purpose of the constant tq. 

Now suppose that there is some random mapping / : S ^ Ai. A pullback causal structure 
on 5* (see [H]), which is denoted by -</,g, is defined as follows: p -<f^g q if and only if 
f{p) ~^M,g /(?)• It can be easily shown that in a local region of a manifold, where the 
geometry can be assumed to be locally fiat and any pair of points is connected by unique 
geodesic, if -<s=-<f^g then rA^,g(/(p), /(g)) ~ fcd^5,^s(p,g) (see, for example, [I3]). Here, 
'^M,g{.f ip) 1 f {l)) stands for the length of the longest possible geodesic 7 in connecting 
fijp) and /(g), the metric ^f^y is used to measure that length, and kd is some coefficient that 
depends on dimension, ci, on a manifold. 

While this point is controversial, this paper takes the point of view that Ts{p., q) is exact 
rather than an approximation. After all, it is illogical to claim that a definition is not exact, 
since in this case one would need another, presumably exact, definition to compare it to. 
The latter would require a unique way of defining M. into which 5* is embedded, which would 
be a source of, presumably exact, quantity we are interested in. Even if this was possible, 
doing that would totally contradict the philosophy of causal set theory. In light of the fact 
that discrete causal structure is viewed as fundamental, while continuum is only something 
that is emergent on a large scale, the latter is an approximation for the former. This can 
be contrasted with the fact that in the case of Poisson distribution of points the discrete is 
an approximation for the continuous. This reinforces the idea of taking what used to be an 
approximation in continuum case, and make it exact for a causal set. 

In order to understand the way in which the definition and approximation reverse each 
other, a good example to think about is definition of trigonometric functions on a ring where 
the notion of limit, and therefore infinite series, is not defined. According to the philosophy of 
this paper, tangent of an element of a ring can be defined as a Taylor series up to some fixed 
constant, say 100 (we will assume that multiplications and divisions by fixed real numbers 
are well defined). The expansion up to 100 is, by definition, exact, and it continues to be 
exact even on the vicinity of 7r/2, including the point 7r/2 itself! On the other hand, the 
infinite Taylor series is no longer exact, but only an approximation; that approximation 
breaks down in vicinity of 7r/2. 

2.2 Basic principles of PST discretization 

We would now like to rewrite the philosophy of the previous section, replacing the 
Poisson distribution in OST, with the one in PST. First, consider an xy plane, where x and 
y are OST coordinates. A random scattering of points on xy plane can not be produced by 
first taking a random scattering on x axis, and then, for each of the scattered values of x take 
random scattering on y axis. Likewise, in our case, since x^ and are separate dimensions, 
it is not correct to construct discretized PST by taking a union of random sprinklings on 
the tangent planes at randomly sprinkled points in OST. It is likewise not correct to view 
PST as a set of pairs of elements of OST since that, too, would result in PST fiuctuations 
being the function of OST ones. Instead, we view PST as a single continuum manifold that 
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was given from the start, and consider a single Poisson process involving scattering of points 
on PST itself. This means that if we wish to view the points in PST as vectors in OST, 
the probability of finding two vectors coinciding at a point is 0. Besides, even if such pair 
of vectors did exist, it would be irrelevant, just like it is irrelevant if the x coordinate of two 
scattered points in xy plane happens to coincide. 

This raises a question: an element of PST is defined to be a vector in OST. In light of 
this, in order to define PST without first defining OST, we have to define a vector directly, 
without resorting to identifying it with a pair of points in OST. Of course, we would be able 
to do that if we are to assume the existence of a continuum geometrical background. But 
how can we rely on its existence, if that is something we ultimately intend to get rid of? We 
address this by recalling that in case of OST-based causal set theory, the definition of causal 
relations was also motivated by continuum geometry: two points are causally related if and 
only if one can go from p to q without going faster than the speed of light; the word "speed" 
requires the presence of geometry as well as metric in order to be well defined. So, in order 
to see whether our present geometric construction is "legal" or not, we have to outline what 
was done in case of causal set theory in OST, and see if we can do parallel steps for the case 
of a PST. For the discretization of an OST we did the following: 

1) For a manifold A4, with metric g^^, define a partial order -<M,g- This requires the 
use of continuum geometry and metric. 

2) Define the notion of distances on M., that is, TM,g{p, q)- Again, this requires contin- 
uum geometry and metric. 

3) For arbitrary causal set S with causal relation -<s, define Ts,-is{p,q). Note: S has 
nothing to do with Al, and, technically, ts has nothing to do with TM,g either. Thus, no 
cheating was done. 

4) For any function f : S ^ M. (which is interpreted as Poisson scattering of points on 
and any metric g^y on Al, define a partial ordering -<f^g on S as follows: P ~<f,g q if and 

only if /(p) -<M,g f{q)- 

5) Verify that, if ^5=^ f,g then, with high enough probability, TM,g{f{p), f{q)) kdTs{p, q) 
for some dimension-dependent parameter k^. This step is what we keep in mind while doing 
parts 3 and 4. However, since we didn't formally say it, no cheating has occurred. 

Now, in light of the above scheme, geometry can be used to do parts 1 and 2. As long as 
one is allowed to use continuum geometry, one is allowed to use as much of it as necessary, 
including postulating tangent vectors to the manifold. Now, if we denote continuum OST 
manifold by A4ost — -M. and corresponding continuum PST one by A4psT, then the above 5 
parts can be repeated, replacing Most by AdpsT (and likewise replacing f : S ^ -Most with 
/: 5 — )■ Aipsr)- The fact that continuum geometry was used in defining Aipsr based on 
M. has no impact on that statement. In fact, as you will shortly see, in order to define causal 
relations in PST even more advanced smooth geometry will be used, including geodesies, as 
well as their relative velocity at junction points. All of this will be "legal" because it will be 
understood that whenever such constructions are done, we are dealing with parts 1 and 2. 
Despite all this, the end result will not be continuum-based, thanks to parts 3 and 4. 
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Furthermore, despite the fact that in parts 1 and 2 the Most was used to construct, 
and define geometry on, Aipsr, we still view Aipsr as fundamental. The reason for this 
is that parts 1 and 2 are not an official part of the definition of a causal set; they are only 
a motivation. The verification of part 3 with parts 4 and 5 is independent of the way in 
which the continuum was obtained; in other words, it is independent of parts 1 and 2. On 
the other hand, in light of differences in commutation properties of OST versus PST/OST, a 
special, non-trivial, construction will be invented in order to define OST as a subset of PST 
(which is to be contrasted with the way we started from OST to obtain PST in continuum 
case). This will be done through defining different geometrical quantities, such as "relative 
velocity" and "displacement" on PST causal set. Thus, two elements of PST causal set 
with non-zero relative velocity but zero displacement will be interpreted as two OST vectors 
at the same (OST) point. However, both "relative velocity" and "displacement" will be 
viewed as distance-like parameters, albeit with different properties. Furthermore, it will be 
understood that the discretized version of their commutation properties only applies to a 
small class of causal sets, that is referred to as PST-like. This is similar to the situation 
in causal set theory based on OST where manifold properties, such as approximate local 
validity of Pythagorean theorem, only apply to a small subset of causal sets, referred to as 
manifold-like (or, in the language of this paper, OST-like), even though the entire theory 
(including definition of length) is officially applied to all causal sets, while its secret aim is 
to get expected results on that small class. 

Nevertheless, we are not completely done with parts 1 and 2, so in much of the ramainder 
of the paper, we will use these parts in order to define the above parameters. Thus, while 
officially we will view OST as a subset of PST, we will secretly continue to view PST as an 
extension of OST while motivating our definitions. 

2.3 Causal structure on PST 

Let us now move to the actual definition of causal relation on scattering of points on a 
PST. As was stressed in the introduction, our theory is based on the assertion that there is a 
fixed scale in which direct interactions are allowed. That scale is small in its projection onto 
OST, but it can be either small or large in terms of its projection on PST/OST. In order 
to be able to accomodate both, the geometry itself should have small scale, both on OST 
and on PST/OST. Moreover, since, as stated earlier, we do not separate these two sets, and 
scattering of points only respects the structure of entire PST, we can simply say that the 
geometry has to be local, up to very small scale, on PST. 

In light of the philosophy of OST causal set theory, the geometry can be reproduced 
through causal relations, and causal relations are possible trajectories of photons. If we adopt 
that philosophy into PST causal set theory, we would conclude that we need photons to have 
velocity f < c in order to "pass" by an element (x, v). Furthermore, they have to have finite 
acceleration, in order to be able to "probe" the small PST/OST scale (this, again, was 
originally proposed by Caianiello for continuum case - see [IS]) . According to this model, 
the photons are undergoing the random walk in PST. In Newtonian case, the random walk in 
phase space amounts to velocity being random; and, most "random" velocities are arbitrary 
large. In relativistic case, the statement that most velocities are arbitrary large translates 
into a statement that most velocities are arbitrary close to c = 1, which is a consequence of 
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non-compactness of Lorentz group. By making the time scale of the random walk very small 
(but finite), we would see that after small, but finite, period of time the photon will almost 
reach a velocity c = 1, as desired. 

As an important aside, we have to note that, as was mentioned in the introduction, and 
expended upon in section 4.4, the so-called "interaction" terms in the Lagrangian couple 
(f){x,v) and (f){x,w), where — < A, and A is very large. Thus, the "interactions" 
that is used to probe small scales in PST are limitted to kinetic terms in the Lagrangian. 
The random walk in PST is something that photon does by itself, without any interaction 
with matter fields. Depending upon the philosophy of a reader, one might say that while 
photon's interaction with matter involves large momentum exchange, its interaction with 
vacuum involves a very small one. This, of course, hints at the idea of vacuum energy. 

Let us now go back to the mathematical description of random walk in PST. We start 
out at {xi, vi). Then, we make a "jump" in velocity, to {xi, v'() (the reason v" is used instead 
of v'l will be clear shortly). Then, we make a "jump" in position by traveling to {x2,V2), 
where {xi,v1) and (x2, ^2) are tangents to the same OST geodesic 71. Then, again, we make 
a "jump" in velocity by going from (2:2, ^2) to {x2,V2). Then, again, we make a "jump" in 
position by traveling along OST geodesic 72 from (0:2,^2) to {x^^v'^), and we keep going in 
the similar fashion. Thus, we generate a sequence of points {xk,Vk) and OST geodesies 
(or, in Lorentz notation, points (x^, v^) and OST geodesies 7^), where k goes from 1 through 
n. These OST geodesies are viewed to be differentiable functions 7^ : R — >■ A4osTj where A4 
is OST manifold, satisfying 

such that the change of velocity upon passing each point is bounded by some parameter. 
Now, we notice that, in light of the fact that each 7^. is parametrized from to r^, we have 
IkiTk) — 7fe+i(0). Thus, the constraint on the change of velocity can be written as 

d7^ dYk+, 

where the norm 1 is assumed: 



- - 1 (3) 

Now, if left as is, this would still have a lightcone problem, with c-lightcone being replaced 
with i>- lightcone. This problem can be avoided by imposing additional constraint, that states 
that Lorentzian distance between Xk and x^+i is smaller than 5. In combination with norm 
1 condition above, the distance constraint can be expressed as 

Tk < S (4) 

Since Lorentzian distance is c-based rather than f-based, most of f-based "light cone" is far 
away in Lorentzian sense. Equivalently, the criteria of two vectors being next to each other 
in a chain combines the I'-lightcone with c-lightcone. It is easy to see that the combination 
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of these two lightcones uniquely defines "preferred frame" (which points in a direction of the 
OST vector, corresponding to PST element of interest), and thus one has means of defining 
local region in that frame. 

We are now ready to define a causal structure on the scattering of points in PST. Two 
points {y, u) and (2;, w) are causally related if there is at least one integer n G N for which the 
above chain can be constructed, with {y,u) = {xi,Vi) and {z,w) = {xn,Vn) (this ammounts 
to replacing -<M,g with -<Mj,g since now it is a function of the scattering f : S ^ M., but 
this does not affect further steps, nor the conclusion of that argument). Thus, while the 
Lorentzian distance between two neighboring elements of a chain is bounded above, the 
distance between two arbitrary causally related elements is not. Incidentally, this borrows 
some aspects of graph theory advanced by Krugly, in that the original links (which are viewed 
to be edges of a graph) are non-transitive, while the (transitive) causal structure is defined 
by chains constructed out of these links. However, taking advantage of this extra structure 
is beyond the scope of this paper. 

The above discussion sums up in the following definition of a puUback causal structure 
-<f^g for scattering f : S ^ M.psT and the metric g^u on M.: 

Definition: Let be a set and let /: ^ M.psT be a random scattering on M.psT- 
Then the pullback causal structure on S corresponding to /, which is denoted as is 
defined as follows: if p and q are elements of S, then p -<f q ii and only if one can find 
a sequence of other elements of S, ri,...,r„_i, and a sequence of differentiable functions 
71, 7„: R — >■ which satisfy the following conditions: 

^) dr dr - ^ 

hi ^ - 

^) dT^ PO- dT dT 

c) 7fc(0) = < and 7;^(Tfe) = x^^^, where Tk < 5 

dj 9tiu-d;f\T=Tk—d;r\'r=^ < i + e 

e) m = (71(0), ^|.=o) ; m = (7n(r„), ^|.=.„) 

2.4 Definition of Geodesic 

NOTE: from here on, the expression vector means the same thing as "element of PST". 
This is similar to the expression "vector on a vector space". 

In the remainder of this chapter we will attempt to define parallel transports, rotations, 
distances and angles, which would give us all of the necessary geometry to be able to define 
PST geometry in chapter 3 and quantum field theory in Chapter 4. 

Most of the constructions that will be used rely on the notion of geodesies in discrete, 
PST-based causal set. In light of the fact that /-image of each element of S (which, sloppily, 
will be identified with clement itself from now on) can be viewed as a vector in OST, we 
would expect that each p E S determines unique geodesic passing through p. We have 
previously mentioned that causal structure on PST is motivated by a toy model of classical 
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photons undergoing bounded-above changes in velocity in random walk fashion. In this case, 
the geodesic can be viewed as a "more likely" path of such photons. 

However, while geodesic might be more likely than other paths, it is certainly not likely 
that the photon will follow that geodesic, or any other well defined path for that matter In 
fact, the application of random walk to flat spacetime tells us that the photon will move 
arbitrary close to c = 1 with respect to any given reference frame, after enough time has 
passed, while geodesies have an arbitrary constant velocity v < c. Thus, we would hke to 
say that geodesic is "the least un- likely path, while still very un-likely". More precisely, if 
an element g is a distance Tq away from the element q on the geodesic passing through p, 
the probability of actually reaching q after proper time Tq is very small. Nevertheless, that 
same probability is much larger than the probability of reaching any other fixed element, r, 
after that same time r. 

The discrete version of the path of fixed proper length is a sequence of points p ~<* ri ~<* 
... -<* r„_i -<* q with fixed n. Here, -<* is a direct link; that is, a ^* 6 if and only if a ^ 6, 
and there is no c satisfying a < c <h. Thus, q is part of geodesic passing through p if there is 
at least one integer n for which q is connected to p by more of the above sequences than any 
other PST element. However, in light of discreteness, there can be more than one element q 
that meets that description. Since we don't have any other means of differentiating between 
them, we will consider a set of all such elements, for any fixed n. But, in the special case 
of an approximation to the continuum (which is what we are aiming for, in light of parts 
1 and 2), these elements are so "close" to each other that they "look" like a single element 
(in PST terminology we have several points so close to each other that they "blur" into a 
single point; in OST terminology, we have few vector that are "very close" to each other 
and point in "very similar" direction that they "blur" into a single vector). So, even though, 
strictly speakng, it is a set, it "looks like" a translation of a single point p by a fixed distance 
along geodeics. In light of this intuition, such subset of S will be referred to as either past 
geodesic translation or future geodesic translation of an element p, depending on whether 
it is before or after p. It is strictly a translation along the direction of vector itself, and 
is not to be confused with a more general parallel translation, that simply happened to be 
time-like. This brings us to the following definition: 

Definition: Let p be an element of -S" and let n > be positive integer. Then future 
geodesic translation of p by n is a set Gn{p) C S consisting of elements q with largest possible 
number choices of sequences p -<* ri -<* ... -<* r.„_i -<* q. Furthermore, the past geodesic 
translation of p by n is a set G^nip) C 5', consisting of elements q' with largest possible 
number of choices of sequences q' -<* si -<* ... -<* -<* p. Finally, Gq{p) is a one element 
set, containing p as its only element: Gq{p) — {p}. 

Thus, we note that Gm{p) is a set of elements that have coordinate t = m in the reference 
frame defined by p. However, there are a number of features that one would not expect from 
ordinary geometry: 

1) The set Gm{p) can contain more than one element, as stated earlier. 

2) It is possible that the same element q is an element of both Gm{p) and Gn{p) for 
different m and n, as long as they are both non-zero and have the same sign. 
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3) If m < n, qi E Gmip), and q2 G Gn{p), this does NOT mean qi -< q2j unless either m 
and n have opposite signs, or both of them are equal to 0. In fact, it is even possible that 

q2 -< qi- 

Now, a geodesic passing through p, which is also referred to as trajectory of p, is simply 
a union of all possible future and past translations of p: 

Definition: Suppose p is an clement of S. Then the trajectory of p, or, equivalently, a 
geodesic passing through p, is G{p) C S defined as 

Gip) = U (5) 

The past and future trajectories of p are G-{p) and respectively, and are defined as 

= G nJ-(p) ; = G n r{p) (6) 

where 

J-(p) = {qe G{p)\q -< p} ■ J+{p) = {qe G{p)\q y p} (7) 

In light of the fact that Go(p) is one-element set while Gm{p) is not, the geodesic has some 
"thickness" and it gets "thin" at p. This implies that, even if q is an element of G{p), G{p) 
and G{q) do not coincide. 

2.5 OST coordinates relative to an element of PST 

Since every PST element p, if viewed as an OST vector, determines a reference frame, it 
should be possible to separately define space and time displacements of some other element, 
q, relative to p. If photons were moving with speed c, we would imagine a photon being 
emitted at some element r G Gm{p), reaching q and then after being re-emitted from q, being 
absorbed at s G Gn{p)- Then, assuming c — 1, the time and space displacements of q are 
(n -|- m)/2 and (n — m)/2, respectively. 

I claim that we can continue to do the same thing in our case, despite the fact that 
speed of light is not constant. After all, if the density of the scattering of elements on a PST 
is high enough, the photons will reach near-lightlike velocity within a very short time, with 
probability close to 1. This means that the definition of the space and time displacements 
will approximate what we are used to on the scale much larger than the atomic time interval. 

Furthermore, as was discussed with the example of trigonometric functions, there is no 
such thing as a definition not being exact. Thus, by definition, the proposed notions of space 
and time displacements are exact, on all scales. On the other hand, since velocity was never 
defined to be a time derivative of position, but rather it was viewed as a separate coordinate, 
the latter can, in fact, break down. Thus, on the small scale we have 

dr , 

= ! (8) 

The above does not refer to any effects of projections. In fact, even in 1 -|- 1 dimensional 
case, what we used to view as a definition of velocity breaks down on a small scale. This is 
strictly a small scale effect, specific to this theory. 
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It should also be mentioned that the above construction relies on the fact that the 
element of interest is outside of the trajectory of p. After all, if the trajectory happened 
to have thickness of a single element (although it doesn't have to be) then the only way of 
applying the above construction to the element on the above trajectory would be for that 
element to send a photon to itself. This is not allowed since, conventionally, elements of a 
causal set are not causally related to themselves. For that reason, we have to separately 
define, by hand, that if g e G{p) then its space displacement is 0. 

Now defining the time displacement of q is more complicated since it can be an element 
of both Gmip) and Gn{p) for different m and n. We would like to choose a definition of time 
displacement in a way that is the most consistent with the way we treat elements outside of 
G{p) . It is easy to see that this can be done by defining time displacement to be an average 
of the largest and smallest possible m-s satisfying q e Gm{p)- 

Finally, if the space time curvature is large enough, it is possible that G{p) " accelerates 
away" from some elements s outside of G{p) so fast that the imaginary photon emitted from 
s would never "catch up". In such situation, we will formally define the spacial distance 
as +00 and the time separation is +00 if the lightcone of s intersects G-{p), —00 if the 
lightcone of s intersects and ioo if neither intersections occur. 

Of course, it is understood that the apparent infinities are most likely the consequences of 
curvature rather than the actual infinite distances. But, as was discussed in the trigonometry 
analogy, every definition is automatically true. Thus, the above definition is "legal" as long 
as used consistently. Now, in light of the fact that both intended geometry and intended 
physics arc local, the distances between far away objects arc not important, which is why 
the above definition is acceptable. The only purpose of infinity is to formally rule out these 
elements when it comes to defining a "neighborhood" based on the distance or time being 
less than some constant 5. 

The above discussion can be summarized in the following definition: 

Definition: Let p and q be elements of S and suppose q is not part of trajectory of p. 
Also suppose that J~{q) (^G{p) and J^{q) nG(p) are both non-empty. Let m be the largest 
possible integer for which n J^(g) is non-empty. Likewise, let n is the smallest possible 

integer for which Gn{p) n J^{q) is non-empty. Then the time displacement of q relative to p 
is given by 

tp{q) - (9) 
and space displacement of q relative to p is given by 

n + m 

rp{q) = (10) 

On the other hand, if either J~ {q)nG{p), or J^{q)riG{p) is empty, or both, then rp(g) = +00. 
If J~{q) n G{p) is empty while J~^{q) H G{p) is not, then tp{q) = —00. If J~^{q) n G{p) is 
empty while J~{q) n G(p) is not then tp{q) ~ +00. Finally, if both sets are empty then 
tp{q) — ioo. Finally, if g e G{p), then rp{q) — and tp{q) is the average between the largest 
and the smallest n satisfying q e Gn{p)- 
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It is important to note that, as a result of the fact that it is conceivable to have ri -< r2, 
despite ri G Gm{p), G Gn{p) and m > n, one can also obtain negative space displacements. 
In particular, assume that ri -< s -< r2 for some s outside of G{p). Let M be the largest 
possible integer such that at least one element of Gm{p) is to the past of s, and let N be 
the smallest possible integer that at least one element of Gn{p) is to the future of s. Since 
we already know that m and n satisfy these conditions, we know that M > m and N < n. 
This, together with m> n implies that M > N. Thus, rp(s) ^ {N - M)/2 < 0. 

However, in section 2.7, when the notion of PST-like causal set will be defined, a vector- 
valued two point function x: S x S ^ M.'^~^ will be postulated with a requirement that 
q) \ ~ ^ < ^v^^) ^ \^{Pi ?)l + ^- This, together with the fact that |a;| > will imply that, 
while rp{q) can still be negative, it can not be smaller than — e. 

Now that we have the definition of space and time coordinates with respect to a given 
element, let us now go back to the issue mentioned earlier, that the trajectory of the photon 
obeys r — t, despite the fact that its velocity is less than 1. We will state it as a theorem, 
which we are now in a position to prove rigorously: 

Theorem: Let p and q be two elements of a causal set S, and assume that rp{q) is 
finite. These two elements are causally related if and only if \tp{q)\ > \rp{q)\. 

Proof: Since rp{q) is finite, we know that there are only two possibilities: either q e 

G{p), or else there is at least one element of G-{p) that is before q and one element of G+{p) 
that is after q (strictly speaking, the former case is a special case of the latter, but for rigor's 

sake they are treated separately). 

Lets start from the situation where q G G{p). In this case, by definition, = 0. 

Since absolute value can not be negative, this automatically implies that > 0, thus, 

\tp{q)\ > ''"p{q). At the same time, by definition of G{p) there has to be a chain of points 
either of the form p ~< ri ~< ... ~< r„_i -< q, or q ~< ri ~< ... -< r„_i -< p. Either of the two 
implies that p and q are causally related. Thus, in this special case, the equivalence between 
p and q being causally related and > kp(?)| holds true: namely, both statements are 

true. 

Now lets move on to the situation where q is outside the trajectory of p. Let m be the 
greatest integer such that q is after at least one element of Gmip) and let n be the smallest 
integer such that q is before at least one element of Gn{p). As emphasized earlier, we do not 
know if n < m or n > m. Thus, there are 3 possibilities for either m or n: each of them 
can be either negative, zero, or positive. Thus, total there are 3x3 = 9 possibilities. Some 
of them, however, will be shown as impossible. So I will show, case by case, that for every 
single one of these possibilities, one of the three things happens: 

a) |^p(?)| > fp{.Q) and p and q are causally related. 

b) < and p and q are unrelated. 

c) A contradiction is shown. 

Ceise 1: n < and m < 0. Then rp{q) = [n — m)/2 (which can be both positive or 
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negative) and tp{q) — {n + m)/2. Then 



\tp{q)\ = -tp{q) = - ' ' ^' ' (11) 

and 

, , n — m \m\ — \n\ 
rM = = ' \ ' ' (12) 

This imphes that |rp(g)| < |tp(?)|, regardless of how n and m compare. 

Now, the way n is defined tells us that there is s G Gn{p) satisfying s y q. Now, since 
n < 0, s E Gn{p) implies s -< p. This, together with s )^ q implies that q -< p. So in this 
special case both q ^ p and |rp(g)| < |tp(5)| hold. 

Case 2: n > and m > 0. Then r = (n — m)/2 (and again it can be both positive and 
negative) and tp{q) = ^i±Hi. The fact that n and m are both positive immediately tells us 
that \rp{q)\ < \tp{q)\ regardless of how the values of n and m compare to each other. 

Now, the way m is defined tells us that there exists s e Gm{p) satisfying s -< q. But the 
fact that m > implies that s y p. This, together with s ^ q implies that p -< q. Thus, in 
this special case, |rp(g)| < \tp{q)\ and p ^ q both hold. 

Case 3: m < and n > 0. The fact that m < implies that there are no elements of 
Go{p) that are before q. The fact that n > implies that there are no elements of Go{p) 
that are after q. Thus, together, they imply that every single element of G'o(p) is unrelated 
to q. But, by definition, Go(p) — {p}- Thus, p is unrelated to q. 

On the other hand, we know that 

n-m \n\ + \m\ 
^pW = = 2 ^ ' 

and 

, , n + m \n\ — \m\ 
tM = = ' ' ^' ' (14) 

which tells us that rp{q) > tp{q). 

Case 4: n < and m > 0. By definition of n and m, there exists Si G F^(p) and 
■52 ^ Fn{p) such that Si -< q and S2 >- q. Now, the fact that m > implies that Si >- p, and 
the fact that n < implies that S2 -< p. Thus, we have p-<si-<q-<S2-<p which implies 
p -< p which is a contradiction. 

Case 5: m = and n > 0. The fact that m = means that q is after at least one 
element of Go{p). But, by definition, the only element of Go{p) is p itself. Thus, q y p. Also, 
we know that rp{q) — ^ — tp{q). Since < includes =, this means that |rp(q')| < \tp{q)\ holds. 

Ceise 6: m < and n — 0. The fact that n — means that q is before at least one 
element of Go{p). But, by definition, the only element of Go(p) is P itself. Thus, q ^ p. At 
the same time, tp{q) — y and rp{q) — — y. Thus, |ip(?)| = kj)(?)|- Since < includes =, 
rp{q) < tp{q) holds. 
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Case 7: m = and n < 0. By definition, there exists r G Gn{p) such that q -< r. But 
the fact that n < imphes that r ~< p. By transitivity, this means q -< p. But the fact that 
m = implies that q is after at least one element of Gq{p). Since the only element of Gq{p) 
is p itself, we have q y p, which contradicts q -< p. 

Case 8: n = and m > 0. By definition, there exists r G Gm{p) such that q >- r. But 
the fact that m > implies that r )^ p. Thus, by transitivity, q y p. But the fact that n = 
imphes that q is before at least one element of Gq{p). Since the only element of G'o(p) is p 
itself, we have q ~< p, which contradicts q)^ p- 

Ceise 9: m — n — {). By definition, there exists r G Gq{p) and s G Go(p) such that 
r -< q -< s. But the only element of Go(p) is P itself, which means p < q < p which is a 
contradiction. QED. 

2.6 PST/OST coordinates relative to a PST element 

We have thus finished outlining the basic properties of r and t. However, remembering 
that we are dealing with PST rather than OST, we are not done yet. Our next step is to 
define velocities. This, however, is easy to do: if we would like to measure velocity of an 
element q relative to an element p, we just have to analyze the behavior of rp(s) verses tp(s) 
for a fiowing element s G G{q). 

Now, if tp{q) and rp{q) are small compared to the scale of the curvature, we can select 
large enough m and n so that, if one "looks" at the picture from Gmiq) or Gn{q) one would 
see p and q being very close to each other. Thus, the angular velocity of q relative to p is 
approximately everywhere except a very small segment of the trajectory that happens to 
be close to p. This allows us to estimate the absolute value of the velocity of q relative to p 
without worrying about its angular velocity: 

Definition: Let p and q be elements of S and let n > be a positive integer. Then 
n-th velocity of q relative to p is given by 

mGn[q) 

where stands for number of elements in a set. 

The above definition violates time-reversal symmetry since n, which is assumed to be 
fundamental constant of nature, is either positive or negative. This is not a big deal since if we 
insist on having time reversal symmetry, we can always replace Vp-n with (vp;„(q')-|-i>p;_„(g))/2. 
The only reason it wasn't done that way is simply that if it is not done now it can always 
be done later, but if it is done now, it would not be possible to undo it, since some of the 
information would be lost. 

It has to be noted that no assumption of a presence of geometry or smooth behavior is 
made. Thus, it is conceivable that Vp-n+i = 10^ x Vp-n- In this case, still, by definition, the 
velocity is exactly Vp-n- In fact, even if we had a situation where Vp-n — 1 and Vp-k = 10^ for 
all k ^ n, the velocity will still be 1, by definition. Again, the trigonometric analogy is what 
motivates this assertion. 



15 



It is possible to avoid having to take relatively large distance in definition of velocity, by 
replacing velocity with Lorentz factor 7 that, as usual, represents time dilation. The reason 
V was chosen was that the goal of this construction is to make things look as intuitive as 
possible, while the relation between 7 and v requires a little bit of proof. Nevertheless, since 
this sense of simplicity is subjective, one might argue in favor of 7 since that would not 
require a time displacement by a relatively large distance. So for that purpose, definition of 
7 will be included, and then it will depend on the purpose or philosophy of further work to 
choose between these two definitions: 

Definition: Let p and q be elements of S and let n > be a positive integer. Then 
n-th Lorentz factor of q relative to p is given by 

where ji stands for number of elements in a set. 

Now, in order to describe the motion of q relative to p fully, we do need to define its 
angular velocity. In case of usual geometry it can be estimated as a; = ^sin 9, where 9 is an 
angle between radius and the direction of velocity. It can be further seen that sin 9 = h/r 
where h is the distance between p and the closest vector to p on the trajectory of q. Putting 

these together we obtain 

Definition: Let p and q be two elements of S and let n be an integer. Then, n-th 
angular velocity of q relative to p is given by 

^pAq) = ^^2^'^M')\S e G{q)} (17) 



This immediately gives the definition of radial velocity: 

Definition: Let p and q be two elements of S and let n > be positive integer. Then 
the n-th radial velocity of q relative to p is given by 

<n(?) = ^/v'p;niQ)-rl{qH,niQ) (18) 



3. Axioms that make causal set PST-like 

NOTE 1: Throughout this chapter, the distances are going to be scaled by a small 
parameter tq, so that they no longer have to be integers and the smallest distance is much 
less than 1. This is done in order to make sure that, for small distance scales S, we have 
« 5 which would allow us to define approximations. 

NOTE 2: This chapter can be skipped without any effect on understanding of chapter 

4. 

NOTE 3: In hght of discreteness I didn't even attempt to prove that any given set of 
axioms I propose will formally imply a specific geometry. I do have an intuitive sense that 
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these specific axioms do imply that, but it is up to verification by numeric simulations (which 
haven't been done yet) to see to what extend this is the case. 

3.1 Why cixioms? 

Now we are done defining all of the fundamental parameters. It is important to note 

that, even though geometry was used as part of a motivation, no formal assumptions about 
the geometry were made in actual definitions. Thus, in case of an arbitrary causal set, we 
can still formally apply the above definitions to get space and time displacements as well as 
radial and angular velocities of different elements with respect to each other. The definitions 
were designed in such a way that in a very special case of PST-like causal set they happen to 
approximate the notions that we are used to. But since most causal sets are not PST-like, in 
a general case these notions have nothing to do with any geometry; their definition is purely 
algebraic, just like, say, a definition of prime number. 

To emphasize this point, we can take any given partially ordered set and do two separate 
exercises: one exercise involves defining OST-based quantities, and the other one involves 
PST-based ones. Despite the fact that these two sets of definitions are based on completely 
different assumptions of the geometry of light cone (which leads to different commutation 
properties of underlying space), both exercises can still be formally carried out on the same 
partially ordered set; but, of course, it doesn't mean that on a larger scale a given set can 
resemble OST and PST at the same time. In fact, we can, formally, define PST-based 
quantities on discretized OST, and OST-based quantities on discretized PST (I don't mean 
projection on OST, but rather a definitions based on OST interpretation of the shape of the 
light cone, which is why it is physically a wrong thing to do for PST). In both cases the 
procedure can be formally carried out; but the results would completely lack any physical 
meaning. 

Because of the examples just presented, it is important to understand that PST-based 
causal set is not the same as PST-like one (the same is true if we replace PST with OST). 
When we say that a causal set is either PST-based or OST-based, we are referring to the 
definitions of some so-called geometrical parameters that are defined in purely algebraic way. 
Since these definitions can, formally, be made for any partially ordered set, any such set, 
regardless of its structure, can be viewed as either PST-based or OST-based, as we wish. On 
the other hand, when we say that a given causal set is PST-like, we are saying that if we 
choose to view it as PST-based, then we will find that the causal structure is consistent with 
the one we will get through scattering of points in PST (and, of course, the same statement is 
true if we replace "PST" with "OST"). Thus, a statement that a given causal set is PST-like 
(or OST-like) does say quite a bit about the specific structure of the partial ordering. 

It is also important to notice that the dynamics introduced in Chapter 4 relies only 
on a causal set being PST-based, and it does not assume that it is PST-like. This implies 
that for arbitrary partially ordered set the dynamics will still work in well defined fashion 
but it will not resemble anything we see in a lab. At the same time in a very special case 
of PST-like causal set it will. Even though we target that "very special case", the theory 
doesn't formally rely on it. For this reason, Chapter 4 can be read independently of Chapter 
3. 
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This leads to the following question. Since the dynamics only relies on a causal set 
being PST-based, what is the explanation that the causal set in which we are living in is 
PST-like? This question is by no means new. In fact, the OST-version of this question was 
asked for decades. One way to ask this question is this: since gravitation is a field, and 
that field corresponds to causal structure, there has to be a dynamics that "forces" causal 
structure to look like a manifold on a large scale, but such dynamics was not described yet. 
Unfortunately, there are a lot more problems with viewing gravitational field as field. The 
main ones are the following: 

a) While OST based gravity is described in [1], I have not come up with PST based 
gravity yet (it should be pointed out that this is only due to the fact that PST theory is 
very new, so it is not a serious objection). 

b) OST-based gravity shares the same faults as the version of scalar field Lagrangian 
introduced in (see section 4.1 for the summary of the latter). 

c) Even if gravitational Lagrangian was available, it is not clear how it can be used 
since, once geometry is allowed to fiuctuate, we no longer have the geometrical background 
in which to use Lagrangians. An attempt to adress this issue was made in [5] , but its success 
is very controversial. 

d) Suppose we did have a consistent theory of gravity with Einstein's equation being 
its classical limit, which would imply, for example, zero curvature for vacuum. In this case 
it is still not clear whether the zero curvature implies the presence of manifold structure. 
For example, in the OST situation, where the curvature is defined as in [4j, that assertion 
is not true. Of course, it is possible that the situation would be different in this regard in 
PST scenario. But since no work has yet been done in defining PST curvature, this is still 
a subject of future research. 

e) Even if the dynamics was under control and classical limit of part d implied smooth 
manifold, we still do not know the mechanism by which the geometry "collapses" onto one 
smooth manifold structure rather than another. Again, an attempt was made to adress this 
issue in |5j, but that theory is controversial. 

For the reasons presented above, the dynamics that would generate PST-like causal set 
is postponned for future research. However, there is still a temporary explanation as to why 
our causal set is PST-like, which appeals to the idea originally introduced by David Hume 
([6]), and which I proposed to apply to quantum field theory and causal sets in [5]. First, we 
make the following observation: any kind of dynamical law, such as the most basic classical 
physics, is a result of repeated observations. One might argue that there is no cause and 
effect, but rather there is a pattern of observed correlations that we later interprete as cause 
and effect. Thus, a statement "B happened because of A" becomes a statement "the history 
where A happens and B doensn't is not allowed, while the history where A happens first 
and B happens next is; thus, since we know A have happened, the only allowed correlations 
include event B afterwords". From this point of view, we never explained the occurance 
of B in terms of cause and effect. Instead, we simply postulated the criteria for "allowed" 
correlations, and then occurence of B became a logical {not causal) consequence of that 
postulat, together with occurence of A. 
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The implication of the above philosophy is that the explanation of something is simply 
recognizing the pattern. For instance "the sun is rising this morning because the sun is 
rizing every morning" is a good example of explanation, even though I never used either 
Newtons or Keplers laws to explain why the sun rises every morning. Now,this philosophy is 
perfectly consistent with my eventual use of Newton's laws to explain the behavior of the sun. 
After all, the latter explanation would amount to saying "sun does Y because every physical 
object does Z", where Y amounts to rising every morning, while Z amounts to obeying the 
Newton's gravitation laws. On the other hand, the temporary explanation we had prior to 
that is that sun does X (i.e. rises up this week) because it does Y (i.e. rizes up every day). 
So Y is bigger pattern than X, and Z is bigger pattern than Y. Thus, generalizing X into Y 
and generalizing Y into Z both fit the definition of "explanation". After all, an explanation 
amounts to showing how some smaller pattern can be viewed as part of bigger one. Since 
"smaller" doesn't mean "small", and "bigger" doen't mean "big", it is always possible to 
"explain the explanation" by showing how bigger pattern is part of even bigger one. In fact, 
since any theory is based on axioms, which, in principle, can further be explained by other 
axioms. Thus, there is no such thing as "ultimate explanation". 

In light of this philosophy, the explanation of why causal set is PST-like is similar to sun 
rizing up every morning, while the actual dynamics that generate such a causal set is similar 
to the Newton's law of gravity. Since we claim that it is okay to view "the sun rizes this 
morning because it rizes every morning" as a satifactory theory until the law of gravity is 
discovered, we likewise claim that the explanation "the causal set we are living in is PST-like 
because every causal set is such" is also an okay explanation, until dynamics is introduced. 

Of course, we have all observed PST-like pattern from our everyday experience, so we 
are almost done; but not quite. After all, by PST we mean a continuum background, while 
causal set is discrete. So, our experience presumably teaches us a discretized version of 
PST-like pattern. Thus, in order for that "observation" to count we need to come up with 
rigourous definition of what observation are we referring to. In case of sun rizing up every 
morning, in order for that "theory" to be complete, we have to define what we mean by 
the morning. For example, we can say that a morning is when our watch shows such and 
such time, and then define what we mean by well working watch. Thus, the definition of 
well working watch as opposed to the law of gravity becomes a necessary component of the 
explanation why the sun will rise at a given time. Likewise, in the case of PST-like causal set, 
we have to define what we mean by PST-like; once we do, then we can simply postulate it as 
an "axiom" that the causal set has to be PST-like, and that axiom will be the explanation 
we are seeking. 

Finally, it is important to point out that axiomatization of OST has been dealt with, in 
a lot more rigorous way, in [9J and [10]. In this paper I can't possibly measure up to their 
standard since I am not dealing with continuum, hence I lack a mashinery to do rigourous 
proof. At the same time, thees papers have their own weakness: once the curvature is 
introduced, they become approximations, which break down in the vicinity of light cone. In 
light of the fact that we have addressed the lightcone problem by introducing PST, it might 
be of interest to see if axioms can be invented in order to define, with specified degree of 
accuracy, PST-like causal set, while accommodating curvature. It is also possible that PST 
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version of [9] might be simpler than OST one since in case of a PST the timehke geodesies 
arises in a natural way, so one does not need to introduce intersection of timelike planes 
to define them. However, the axiomatization proposed in this paper is completely different 
from the one of [9] and, therefore, the analysis of the approach of [9] or its modifications 
is beyond the scope of this paper. As far as this paper is concerned, some short cuts were 
made which were not doable within a context of |9], such as appealing to the notion of 
radial, time, and velocity coordinates. This means that, as part of the future research, I can 
abandon these assumptions and still investigate if translating [9J to PST scenario will allow 
the axiomatization without such shortcuts. 

3.2 The philosophy behind the choice of axioms 

Since we are trying to impliment local PST-like topology rather than global one, we 
have to define the notion of locality. In light of continuum, the "local" neighborhood can 
be arbitrary small. In discrete scenario, on the other hand, one requires a certain constant 
that defines the size of generic local neighborhood. This constant can not be imposed on 
Lorentzian distance since, as discussed earlier, most points q that have Lorentzian distance 
to p smaller than e are still arbitrary far from p coordinate- wise. So, instead, the constraint 
is imposed on t, r and v coordinates discussed in the previous chapter. More precisely, I 
postulate a small numbers 5 and /i, as well as a large number A, and define the (/i, 5, A) 
neighborhood of p as a set of all elements q satifying |tp((3')| < /i, rpi^q) < 5, and |fp(q')| < A. 
The reason A is assumed to be large is that we do not have the curvature effects in velocity 
(i.e. PST/OST) directions while we do have them in position (OST) ones. On the other 
hand, the reason A was introduced at all is that in a reference frame that moves very close 
to the speed of light, the length of a single causal link "stretches out" very far coordinate- 
wise. Thus, even if |tp((3')| and rp{q) are both small, this doesn't mean very much since their 
definitions involve "very long" links, and thus their error can, possibly, be of a magnitude of 
several thousands kilometers! Finally, for the sake of convenience of sparing one parameter, 
the two "small" parameters h and 5 will be assumed to be equal, which allows letter h to be 
dropped. 

It might seem that the fact that A is finite (although large) violates the Lorentz covari- 
ance. As explained earlier, this violaltion of Lorentz covariance is similar to the violation 
of translational covariance by a small neighborhood that we are using to from OST-based 
quantum field theory that existed for decades. But, as stated previously, the philosophy 
of this paper is that, while both translational and rotational symmetry is violated by any 
given neighborhood, it will be restored after we consider all possible neighborhoods. In light 
of this philoophy, we are allowed to violate Lorentzian symmetry as much as we like while 
defining the axioms for any given neighborhood. The two particular ways in which we insist 
on violating relativity is introducing finite A and separating space and time coordinates. In 
fact, not only we can do these things, but in fact we should! After all, if we explicitly sepa- 
rate space and time coordinates and still get relativity at the end of the day, this will prove a 
point that relativity arises naturally as a consequence of this approach. On the other hand, 
if we didn't separate space and time, we would have been using minus signs in Lorentzian 
signature to combine them, and then one would argue that we put relativity by hand. 

Now, the easiest way to argue that relativity can be restored after having separated space 
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and time is to use the result of numerical studies done in [13] which shows that in case of the 
random scattering of points on Minkowskian space, there is a very close correlation between 
the longest chain of points and the Lorentzian distance. Thus, by using former instead of 
the latter, we no longer need to remember the minus sign of Minkowskian metric. However, 
we still have to remember the definition of a causal structure in order to be able to list the 
chains of points available. The causal structure is defined in terms of light cone, so we have 
to remember that the speed of light is constant. Furthermore, in order to define speed of 
light, we have to remember the definition of distances on spacelike hyperplane (for flat case). 
That definition relies on Pythagorean theorem. While it is true that in the previous chapter 
the distances were defined without Pythagorean theorem, one can not perform numerical 
studies on generalized spaces, including the one discussed there. 

This raises a question: should Pythagorean theorem be postulated, or should it be de- 
rived? On the one hand, the philosophy of the paper only demands that we refrain from 
postulating relativistic effects; in this respect it is okay to postulate Pythagorean theorem 
since it is not one of them. On the other hand, a Pythagorean theorem is a consequence of 
rotational symmetry, and the latter can be obtained through the commutation of Lorentz 
transformations. In light of this, one might argue that if we "don't know" Lorentz transfor- 
mations, we shouldn't "know" Pythagorean theorem either. However, one might also argue 
that rotational symmetry is only an emergent notion that arises on a large scale. If we are to 
define the discretized version of that symmetry, we would have to introduce a lot of "small" 
constants in order to set up a rigourous criteria of what we mean by approximate symmertry 
(in fact, even though we are not going to take the rout of defining commutation relations, 
we will still have to introduce constants in sec 3.4 through 3.6 while defining other "axioms" 
that are expected to hold "approximately true"). 

In light of this it is the matter of taste as to what approach is more natural. On the one 
hand, postulating Pythagorean theorem requires less axioms and less small constants, so it 
is more natural from the formal point of view. On the other hand, Pythagorean theorem can 
not be explained to the first grader, while the axioms and small constants through which it 
is derived can be; so in this respect they are more intuitive. In light of this controversy, I 
leave it to the reader to choose between sec 3.3 where Pythagorean theorem is postulated 
by hand and sec 3.4 through sec 3.6 where other axioms are postulated in order to derive it. 
In the latter case one can skip sec 3.3 since its results are not used in sec 3.4 through 3.6. 

In fact, if one is not happy with the philosophy of postulating emergent smooth geometry 
at all, one can skip the rest of this chapter altogether, and still read chapter 4. As mentioned 
earlier. According to philophy of this paper, the quantum field theory proposed in Chapter 4 
is applicable to all causal sets, not just PST-like ones. While it is true that a very special case 
of PST-like causal sets was used to motivate the setup of quantum field theory, the latter is 
formally defined without an assumption of the structure of a causal set, and is meant to apply 
to all causal sets. Furthermore, when PST-like structure is assumed in motivational part, one 
is only assuming the Poisson scattering in continuum PST background; thus, neither of the 
axiomatization approaches are used. In light of this. Chapter 4 is completely independent 
of Chapter 3, and one might skip Chapter 3 entirely if one wishes to do so. 

3.3 Defining PST-like causal set while making use of discretized Pythagorean 
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theorem 



As explained in the previous section, we are going to separate space and time coordinates, 
and impose Pythagorean theorem on the space ones. Thus, the OST coordinates can be 
represented by two-point functions x: S x S — ?> M.'^"^ and t: S x S — )> M, that satisfy 
^p{q) ~ \^{PtQ)\ ^-iid tp{q) ~ \t{p,(l)\- The PST/OST coordinates, on the other hand, are 
represented through the "velocity" function v: S x S ^ W^~^, satisfying \v{p,q) \ ~ Vp.n{<l)- 
These are summarized in part b of the definition below. One has to remember to enforce 
transitivity, that is, if 

^(a, 6) fti ^(a, c) PS v (6, c) w (19) 

then 

x{a,c) x{a,h) + x{h,c) (20) 

and 

t{a,c) ^t{a,b) +t{b,c) (21) 
This corresponds to part c of definition below. 

Now, the above "transitivity" relations only apply to the PST elements (or, equivalently, 
OST vectors) whose respective velocity relative to each other is close to 0. In order to include 
other PST elements we have to impose a constraint that if in the frame of PST element 6, 
the OST coordinates of PST element c are close to 0, then, in the frame of element a the 
OST coordinates of PST elements h and c are nearly the same. This will allow us to use 
PST element h whose velocity relative to PST element a is small in order to assess the OST 
coordinates of c, whose velocity relative to a might be large. However, it is important to 
notice that while the velocity of c relative to a might be large, it can't be too close to 1 since, 
in this case, due to relativistic effects, the small shifts due to Poison scattering in one frame 
might look like very "long" in another one. So for this reason the approximate statement 
only applies when Vh{c) < A, where A is some fixed real number that is very close, but 
smaller than, 1. That statement is included, in a more explicit form, in part d of definition 
below. 

The expected Lorentzian distances statistically arise only if the density of scattering is 
uniform, so it is crucial to postulate the latter. In order to do that, we have to first define 
density, and then constrain it to a range between p— A to p+A for some specified A << p < 1. 
While it is possible to define density in terms of Poisson distribution, e~''p"'/n\, I choose not 
to do that since this equation can be derived analytically from much simpler definition: the 
density is the number of very small volume elements that contain at least one scattered point. 
Now, if we view point p as a local origin, in order for theory to be transitive, we would like to 
define density in the 5-neighborhood of p, rather than p itself. In particular, we would like to 
select t, X, and v, satisfying max{\t\, \x\) < S, and \v\ < A, where 0<5<A<1, 5~0 and 
A ^ 1. We would then like to define the notion of probability of finding an element q whose 
coordinates relative to p, {f, x', v') satisfy \t' — t\< xS, \x' — a;| < x5, and \v' — v\ < x5. We 
can define this probability by simply counting the number of all such small regions and see 
how many of them actually contain an element of PST. 

We would formally accomplish this by defining B^^ to be a set of quadruples (p; t,x,v) 
(where max{\t\, \x\) < S and I^TI < A), for which there is at least one element of S with 
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coordinates {t'.x'.v') that differ from {t,x,v) by less tlian x^- We tlien put a restriction on 
tlie measure of sucli set, < iiBj < ii2- All of this will be done in part / of the definition 
that follows: 

Definition: Let d be an integer. Causal set S is PST-like up to (d; 6, e, fii, H2, n) if there 
are functions x: S x S ^ R'^~^, u: S x S ^ M"*"^ and t: S x S satisfying the following 
conditions: 

a) t{p,p) = 0, x{p,p) = 0, and u{p,p) = 

b) For any p and q such that < S, \rp{q)\ < 6, and < S, the following is 
true: 

tp{q)-€5 <\t{p,q)\<tp{q)+€S (22) 

rp{q) - e6 < \x{p,q)\ < rp{q) + e5 (23) 

Vp-M -eS < \v{p, q)\ < Vp.n{q) + (24) 

c) If p, q and r are three elements and maxdvip, q)\, \v(p, s)\, \v{q, s)\) < 5, then \t(p, q) + 
t{q, s) — t{p, s)\ < e5 and |r(p, q) + r(g, s) — r{p, s)\ < eS 

d) If maa;(|ra(&)|, |^a(^')|) < max{\ti){c)\, |r6(c)|) < 5^ and ■max{\va{b)\, |i'6(c)| < A, we 
have \x{a, c) — x{a, b)\ < eS and \t{a, c) — t{a, b)\ < eS 

e) If p, q and s are three elements oi S, s & G{q), \rpq\ < 5, \tpq\ < 5, and \vpq\ < A, 
then 

|^(P) •s) — x{p, q) — v{p, q)t{q, r)\ < eS (25) 

f) Let Bg be a set of all quadruples {p,t,x,v) satisfying max{\t\, \x\) < 5 and I^TI < A 
for which there exists at least one q satisfying \t{p, q) — t\ < S'^, \x{p, q) — x\ < 5^, and 
\u{p, q) — u\ < 5"^. The measure of the set Bs, defined as above, satisfies 

Hi < hBs < H2 (26) 

3.4 Time like coordinate parameters 

In light of the fact that PST element corresponds to timelike vector on OST, we claim 
that a local choice of d elements of PST spans a local OST region. Furthermore, in hght 
of the fact that velocities are defined in terms of OST parameters (time and distance), the 
above choice of d PST elements can ultimately be used in identifying PST/OST coordinates 
as well, which will also be called velocity coordinates. As was also true in OST case, it should 
be understood that, in light of discreteness, the point- wise definition of these statements is 
not true; thus, the notion of spanning can only be defined in statistical way. 

One of the fundamental building blocks of coming up with definition of spanning is the 
idea that the definition of relative velocity in section 2.6 allows us to define the notion of "co- 
moving" elements of PST (or, equivalently, "parallel" OST vectors) as the ones with relative 
velocity close to 0. The fact that relative velocity was defined in non-local way, implies 
that the notion of co-moving vectors is equally non-local, and does not require the notion of 
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parallel transport in order to be well defined (but in practice only the vectors within OST d- 
neighborhood of each other form coordinate system that has physical meaning, which means 
that the word "non-local" should be replaced with "quasi-local"). This choice was motivated 
by the fact that in order to define discretized parallel transport one would have to introduce 
a sequence of parallel elements such that each next element of the sequence is "very close" 
to the previous one. This requires one to be able to define a notion of "nearby" elements 
being parallel to each other. Now, if two elements are trully nearby, there might be too much 
random fluctuation to mess up the parallelism we are seeking. On the other hand, if the two 
elements we are interested in are separated by a distance of order S then, even though S is 
assumed to be small, there is enough intermediate elements for the statistical fluctuations to 
average out, which makes the notion of parallelism more reliable. Thus, ironically, the quasi- 
local notion of parallelism becomes more reliable than parallel transport, which is based on 
the local notion. 

Of course, it is possible to compromise: instead of choosing between sequence of elements 
" next to each other" or no sequence at all, we can construct a sequence of elements within 
a distance 5^ of each other. On the one hand 5^, as small as it is, still contains a lot of 
elements and thus it is statistically reliable. On the other hand, this resembles parallel 
transport. This, however, unnecesserely complicates the definition. At the same time, for 
the purposes of the paper, it is sufficient if "parallel" elements are only parallel up to S 
rather than 6^. For these reasons I chose to abandon the idea of the sequence of elements 
(or parallel transport) and stick to quasilocal picture described earlier. 

Now, lets go back to the definition of coordinate system. In light of the fact that 
each PST element corresponds to the timelike vector in OST, all of the coordinate axes are 
timehke. Thus, a point in OST defined by these coordinates can be denoted as {ti,...,td). 
After having done that, the discretized version of linear combination can always be used to 
convert timelike coordinates to spacelike ones. Now a point in OST corresponds to a subset 
of PST (or, in other words, subset of a causal set S). That subset, of course, will depend on 
the choice of the origin (which corresponds to the OST position of the element po) as well 
as coordinate axes, which corresponds to the elements pi, ...,pd- 

Finally, in order to take into account the dependence on the small OST parameter 5^ 
and large PST/OST parameter A, we put both as a lower indices of F. The parameter 
(5^ was introduced in order to accomodate the fiuctuations due to discreteness. In light of 
the curvature, everything is constrained to the neighborhood of the size that is first order 
small (in particular, are first order small, as well). Thats why the allowed error in their 
measurement is second order small, which is why this is denoted by S'^ rather than S. In 
order to save letters in the alphabet, S will continue to be used to denote the first order small 
size of the local neighborhood. By making S sufficiently small, one can make sure that 5^ 
satisfies expected properties of second-order-small parameter, without a need of multiplying 
it by any coefficient. Finally, in order to define parallelism, we need to use the definition of 
non local relative velocity, provided in section 2.6. That definition was relying on an integer 
constant n. This gives us a final notation for an OST point: Fa ,52^„(po;pi, ■■■,Pd',ti, ■■■,td)- 
As stated earlier, a point in OST is a subset in PST. In light of the fact that S is PST, this 
is a subset {not an element) of S. 
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Now, in light of the curvature, the OST translations do not commute. Therefore, in 
order to define Fj<^s'^niPo'iPi, ■■■,Pd'iti, ■■■,td) we have to specify in what order the above 
mentioned translations are made. Roughly speaking, in order to arrive at any element of 
the set -pA,52,n(po;Pi, ■■■,Pd]ti, ■■■td), one has to start from the location of pi and follow the 
direction of po for the time duration ti, then one has to follow the direction of p2 for the 
time duration t2 and keep going in this fashion, until, in the last step, one has to follow 
the direction of p„ for the duration The words "follow a direction of pk" imply a future 
geodesic translation by using some element "parallel to" Pk, where the quasi- local notion of 
parallelism, discussed earlier, is used. A couple of pages later we will postulate approximate 
transitivity, which is roughly 

{t,,...,U) + {t[,...,Q ^ {h + t[,...,U + t'a) (27) 

and show that it implies approximate commutativity, that is, 

-^A,52,n(Po; ■■■,Pi, ■■■,Pj, ■■■,ti, ...,tj, ...) ^ -FA,52,n(Po; ■■■,Pj, ■■■,Pi, ■■■] ■■■,tj, ...,ti, ...) (28) 

where approximation is defined in appropriate sense. This means that arbitrary choice of 
order in which direction to go first and which to go next has only negligeable impact on the 
parametrization that we obtain. 

Now, lets look a httle bit more closely at the sequence of steps we just described. Suppose 
we first made a step in a direction pi and then p2. In this case, at the very end of the first 
step and at the very beginning of the second step, same location according to 

OST, but not according to PST. Thus, from PST point of view, we really made three or four 
steps. First we started at qi. Then we moved to q'2, which is a future geodesic translation 
of qi. Since ^2 is parallel to pi, in order for our next step to be parallel to p2 we need to 
rotate to q2 which is parallel to the latter. Then we make a step in the direction of 52 ■ We 
could stop here, in which case we have made three steps. Or, if we want, we can perform 
an additional rotation after that step since the definition of F(A, 5^, n)(pi,p2; ^i, ^2) allows 
it. Likewise, at the very beginning, before making the step from qi to q2, one could have 
possibly made a step from q[ to qi, which turins it into total of 5 terms. In any case, we had 
no choice as far as the rotation between first and second step is concerned. Thus, we had to 
make at least three and at most five steps. 

Similarly, the n-step OST sequence of points will become 2n-step sequence from PST 
point of view. In this sequence, qk is a geodesic future translation of q'f^_i, and q'^. is a 
rotation of qk- Now, if qk and q'f, move too close to the speed of light with respect to 
each other's reference frame, then, as a result of relativistic effects, the small fluctuations 
in OST-projection of Poisson scattering will result in very large OST fluctuations of qk in 
the reference frame of and vise versa. In order to avoid this, their relative velocities are 
bounded by A. It should be understood, though, that A can be very large, possibly close to 
the speed of light. This brings us to the following definition: 

Definition: Let po;pi, ...,pd be elements of causal set and let ti, ...,td, as well as A and 
6 be real numbers. Then F\^s,n{pi, ■■■,Pd', ti, ■■■id) C 5 is a subset of S defined as follows: s is 
an element of -FA,5,n(Po;Pi, ■■■,Pd;h, ■■■td) if and only if there exist qi, ...,qd+i and q[, ■■■,q'd+i 
satisfying the following conditions: 
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a) Vp.{qi) < S 

b) max{\r,Ml\tM\)<S' 

d) q-+i G GtXqi) 

e) q'o = po and g^+i = s 

Note that in the above definition the velocity is bounded by S while the position by S'^. 
That is due to the fact that the local neighborhood constraints velocity to A and position 
to 6. Thus, as far as velocity is concerned, 6 is small compared to A; and, as far as position 
is concerned, 5^ is small compared to S. 

Since each of the PST elements has both well defined position and well defined velocity, 
we can get rid of element po by allowing pi be used for both po and pi. More specifically, 
its OST projection will serve the purpose of po while its PST/OST projection will serve a 
purpose of pi. 

FK,5,n{Pu-,Pd]ti-,--td) = Fa^sApi'^Pi^ ■■■^Pd;ti, --td) (29) 

Of course if we do that, then even in the flat scenario, F^ s'^^pa-i^, ...,pa/, Ua^^ Ua^) depends 
on the choice of ai, although it is true that, for any given oi it is independent of the choice 
of a2, What comes to our rescue is that, in light of the curvature, we have already 

allowed the error e5 in our definition. As a side benefit to that, if we make sure that these 
elements are separated by the distance much smaller than e5, they would commute. 

We would now like to define what it means for that choice of PST elements to span the 
subset T C S. This will be defined as follows: 

Definition: Let {pi, ...,Pd} and T be subsets of S, let A, 5, e and a be real nubmers 
and let n be an integer. We say that pi, span T up to (A, S, e, a, n) if the following is 
true: 

a) For any element q & T, there is at least one choice oiti, ...,td, satisfying max{ti, ...,td) < 
5 such that q e FA,5'^,n{pi, ■■■,Pd;ti, ...,td) 

b) Let U be defined as follows: q is an element of U if and only if there is at least one 
element q', and at least one choice of ti, td, t[, t'^ and i'/, t'^ such that 

(i) q e Fa^sApo'^Pi: --^Pd^tu ...,td) 

(ii) q' e FA,5,n{Po;Pl,-;Pd;t[,...,t'a) 
(in) q e FA,5,n(g';Pi,...,Pd;ti, •••,td) 

(iv) There is at leat one k such that \tk — t'^ — t'l\ > e6 
Then fiU < a^T. 

The purpose of part a is to say that the choice of basis is complete. We note that in the 
above definition no accommodation for "missing" some elements of T due to discrete fiuctu- 
ations was made. That is because the definition of Fs2 a{pq',Pi, ■■■,Pd', h, td) incorporated 
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the "width" 5^ of the paths involved. Thus, while the paths of one-element width might 
miss a lot of elements, the paths of width 5"^ would not (in light of the constant tq in the 
definition of distances, one-element width is the same thing as the width tq and it is assumed 
that To << 5^, which means that width 5"^ corresponds to large number of elements). In fact, 
saying that a path of one element width would miss a given element but still hit its vicinity 
is equivalent to saying that a path of larger width would hit that element itself. That is why 
only one of the two provisions is necessary, but not both. 

As far as part b is concerned, a is assumed to be a small constant. Thus, U consists 
of rare elements that are only there due to bizzare discrete fluctuations, and in continuum 
case won't exist at all. Now, (i), (ii) and (iii) are just the definition of notation, and they 
are summed up in the assumption 

{h,...M)^{t\,...,Q + {tl...,Q (30) 

which, of course, we are allowed to make. So the key criteria for U is (iv), which says that 

(t;, Q + {t'i...Q ^ {t[ + t'i, ...,t', + Q (31) 

where the criteria for ^ is e5. After all, since we are restricting ourselves to small 5 neigh- 
borhood (in order to avoid curvature), any meaningful approximation has to be an approxi- 
mation up to something second order small, such as e5. Thus, when we say that U contains 
very few elements, we are implying that in most cases 

(t;, Q + {t'i...Q ^ {t[ + t'l t', + Q (32) 

up to e5. This, in particular, implies commutativity. For example, if d = 2, then 

i^A,52,n(ri, rs; ^1,^2) = (^i, 0) + (0, ^2) (33) 

and 

i^A,52,n(r2, n; ti) = (0, i2) + (ti, 0) (34) 
Then, if we apply transitivity to both equations we get 

FA,5^,n{ru r2] tut2) = (ii + 0, + t2) (35) 

and 

FA,5^,n{r2: n; ^2, ^l) = (0 + ^1, ^2 + 0) (36) 
The lat two equation, of course, imply the desired result 

-^A,52,n(n, r2; tl, t2) = -FA,52,n(?^2, n; ^2, ^l) (37) 

Thus, in the above derivation the only non-trivial step was the tranitivity. Thus, the fact 
that we have postulated transitivity as part of our criteria assures commutativity which 
in turn. As mentioned earlier, the latter is a key ingredient of the meaningfulness of the 
definition of coordinate system - after all, if it wasn't for commutativity there would be no 
justification for the particular choice I have made on the sequence of transitions. 
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We have just defined what it means for the set of elements to span a certain region. Now, 
do we intend to postulate that most choices of d elements span a region? Of course not! 
In fact, the vast majorit of such choices are separated arbitrary far from each other, which 
means that the curvature does not allow this to happen. We do want to claim, however, that 
most local selections of d elements span a region. We formally define the set of all possible 
local choices of d elements of S by branching of S, as follows: 

Definition: Let 5" be a causal set and let A and S be real numbers and let n and d be 

an integer. Then [d; 1^,8, n) -branching of S", which is denoted by Bd-\^s,n{S) is a subset of 
S'^, denoted by B(i-A^s,n{S), and is defined to be a set of all d-tuples of elements {pi, ■■.,Pd) 
satisfying 

max{\tpXPj)\,rp^{Pj)) < S (38) 

and 

max{\vp^.n{Pj)\) < (39) 

Now we would like to define what region are we wanting the branching to span. We 
would define that local region to be a cylinder around one of these elements of a specified 
size, where a cylinder, whose axis coincides with the direction of OST vector corresponding 
to that particular PST element. This cyhnder is defined as follows: 

Definition: Let 5* be a causal set and let h, r and A be integers. Then, for any p E S, 
the cylinder Ck^r;A{p) consists of points q for which \tp{q)\ < h, \rp{q)\ < r and |'i'p(?)| < A. 

In order to save letters, we will assume h = r = 6 from now on. We now will define 
the set of d-tuples that map all of the d cylinders defined by each of its elements to be map 
generators: 

Definition Let pi, ...,pd be elements of S, let A, S, and a be real nubmers and let n be 
an integer. We say that {pi, is a map generator of S up to (A, S, a, n) if for every k, 

{pi, ...,Pd} spans Cs,5;a{p) up to (A, 6, e, a, n). The set of all map generators up to (A, S, a, n) 
will be denoted by m{d; A, 6, a, n) C S'^ 

Finally, we are ready to define what we mean by S being d-dimensional: 

Definition Let A, 5, and a be real nubmers and let d and n be an integers. S is said to 
be d-dimensional up to (A, 5, cr, n) if 

^{Bci.A,5,n{S) \ md;A,S,<T,n) ^ ^ ^^q^ 
^Bd-A,d,niS) 



It is important to note that in standard causal set literature the notion of dimension 
is defined in a much looser sense and it relies on various statistical properties which are 
well defined for all causal sets: both manifoldlike and non-manifoldlike ones. However, for 
the purposes of this paper, that is useless as far as defining the notion of PST likeness is 
concerned. Thus, much stricter criteria for a dimensionality is introduced, which singles out 
a very small class of causal sets; the PST-like causal sets will be defined as even smaller 
subclass of that class. 
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3.5 Distance properties of PST extension of 1+1 dimensional submanifold of 
OST 



Despite the fact that we have found a way to make sure that causal set has local d- 
dimensional OST coordinate system, this does not yet prove that it is either OST-like or 
PST-like from metric perspective. In fact, we are yet to see whether simple things like 
Pythagorean theorem or Lorentzian geometry hold. As stated earlier, we would like to avoid 
actually postulating Pythagorean theorem or any other ingredients of the notion of a metric, 
and instead postulate some geometrical notions that are more intuitive to grasp, and select 
them in such a way that they ultimately imply the sought-after properties of a metric. 

We know that if we consider scattering of points in 1 + 1 dimensional OST, we will get 
the expected properties of the Lorentzian distance. In the PST case, in light of the fact that 
the fictitious photon moves with near-lightlike velocity within a very short time interval of 
its imaginary emission, the result of OST-based causal set theory is carried over on a larger 
scale. This suggests that we can employ some constructions in d dimensional OST that 
involves 1-1-1 dimensional OST planes (each of which will then be extended to PST through 
tangent vectors) in order to use the 1 + 1 dimensional geometry as a stepping stone to obtain 
the d dimensional one (here, as well as throughout remainder of this paper, when we speak 
of certain number of dimensions, we are referring to OST ones, unless otherwise specified). 

Now, in 1 + 1 dimensions it might still not be as easy as it seems. In order to be able to 
talk about 1 + 1 dimensional Poisson scattering, we have to know what we mean by " uniform 
density". In order to say that density at {ti,xi) is the same as density at {t2,X2), we have 
to say that a square defined by 

{{ti-St/2,xi-5x/2), {ti-St/2,xi+5x/2), {ti + St/2,xi-Sx/2), {ti+St/2,xi + 6x/2)} (41) 

gets as many points as the square defined by 

{ (t2 - St/2, X2 - 6x/2) , {t2 - 6t/2, X2 + 6x/2) , (ta + 5t/2, X2 - 6x/2) , (ts + 5t/2, X2 + 6x/2)} (42) 

It is very important that these squares are much smaller than the original 5-size neighborhood 
we are concerned about. After all, if we wanted to, we could have replaced t and x by sin t 
and sin x in the above definitions; this would, clearly, imply non-uniform density. Now, if 
we were to look at the number of elements received at the square of size 5, then there would 
be no way to distinguish the two. But, by partitioning 5 into the smaller intervals of size 5^ 
we can, in fact, distinguish these two cases: in one case we have 



as defining features of the size of small square. Now since we have postulated transitivity in 
the last section, we can focus on a single square around local origin. 



t — mS"^ , X — nS^ 



(43) 



and in another we have 



sin t = m6^ , sin x = n6^ 




{{-St/2, -Sx/2), {-St/2, Sx/2), {St/2, -Sx/2), {St/2, Sx/2)} 



(45) 
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and the rest will follow by transitivity. 

To transform the above considerations into rigourous definition, we have to define a class 
of two dimensional subsets of S (or, in other words, plane patches embedded into higher 
dimensional spacetime) which, on average, suppose to receive equal number of elements. We 
proceed in the following way: first, as expected, we select two PST elements (or, equivalently, 
two OST vectors) p and q to define two-dimensional coordinates of the plane in which the 
parallelogram sits. If we were interested in the OST topology, rather than the PST one, we 
could build the parallelogram out of " thickened points" , where by a " thickened point" we 
mean a region of the size S^. Thus, the parallelogram would look like this: 

t^A,52,n(P,?) = U -^A,53,n(P,?;ii,4) (46) 

Here, the size of entire region is 5 in order to avoid curvature, the size of small sample square 

is S"^ in order to distinguish t and x from sin t and sin x (see discussion above), and the 
size of thickened point is 5^ in order for it to measure the structure of the square of size 5^ 
precisely enough (after all if thickened point had a size 5^ we would have an error of the 
order 5^ in measuring the size of the square, and thus we could mistake a square of size 5^ 
with a square of size 25^ and get the density off by a factor of 2) . 

Now, since we are interested in PST topology, we have to add similar restriction on ve- 
locities. On the one hand, the smallness of the local neighborhood imphes that the velocities 
have to be restricted to the range of a very small width, 5. On the other hand, similar to 
space displacement mentioned in the previous paragraph, that range can be displaced from 
velocities of p and q by very large amounts, as long as that displacement doesn't exceed some 
large number A (as before, A is assumed to be very close to speed of light since there are no 
curvature effects in velocity direction; at the same time, A is assumed to be slightly less than 
c in order to avoid the situation when the spacing between points in Poisson distribution 
appears to be large due to relativistic effects). Thus, we need some third vector, s to define 
the velocity range 

Vn{s-5) = {e\\vs;n{(^)\<5} (47) 
Therefore, the generic parallelogram becomes 

M^A,5,n(P> ii, t2, 5) = UA,S^,n{P: Q', ^2, S) D Vn{s; 5) (48) 

where it is understood that 

max{\tp{s)\, \rp{s)\, \tg{s)\, \rg{s)\) < 5 

max{\tpq\, \rpq\} < Se (49) 

Then, in order to obtain 1+1 dimensional geometry, we will go ahead and define the average 
of 1 + 1-dimensional "densities" of Poisson distribution over all of these 1+1 dimensional 
parallelograms. This, of course, would not be possible in the continuum scenario, but it can 
be accomplished in discrete case: 

PA,n(*l, 5) = — (50) 
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Finally, we impose a constraint that the global average of the density, as defined above, 
is restricted to some, possibly narrow, range of numbers. This, together with transitivity, 
implies a uniform probability density on each small 1+1 dimensional region. At the same 
time, the fact that the constraint is only imposed on global average of the density, allow for 
large local fluctuations, just as normally expected of Poisson distribution. 

This, however, is not enough to obtain relativity on a timelike two dimensional surface. 
The other piece of information we need is a constant speed of hght on that surface. Since 
both of our coordinates are timelike, in order to deflne speed of light we use the Lorentz 
transformations 

t = llh + 72^2 ; X = JlViti + 'j2V2t2 (51) 

Now, since the relation between 7j and Vi is derived by means of Pythagorean theorem, which 
we are not supposed to know, we are forced to replace 71, 72, 71 Wi and 72 f 2 with ki, k2, 
and A;4 respectively, where the latter are four independent numbers. 

Furthermore, due to the random fluctuations, two things will happen. First of all, the 
lightcone is not exact but only approximate. And secondly, the approximate light cone is not 
true all the time but only most of the time. Thus, we define a "linear portion" to be a set 
of pairs of points for which the approximation holds, within a specified degree of precision, 
and then we simply say that out of the sample of all pairs of elements that are close to each 
other, the percentage of the ones that don't belong to "hnear portion" is very small. This, 
formally translates into two definitions: 

Definition: Let 5 and e be real numbers and n be an integer. A linear portion of S 
based on (A, 5, e, n) is a subset of 5" x 5", denoted by LA,5,e,n{S) which consists of all pairs 
(p, q) for which one can find constants ki, k^ such that, whenever a and b satisfy 

max{\rp{a)\, \tp{a)\, |r,(a)|, \tg{a)\, \rp{b)\, \tpib)\, \rg{b)\, \tg{b)\) < 5 (52) 

and 

min{\vp.n{a)\, \vq-n{a)\) < A (53) 
the following statements are true: 

a) If a -< b, then tp{a) < tp{h) and tq{a) < tq{h). Likewise, if 6 -< a, then tp{b) < tp{a) 
and tq{b) < tq{a) 

b) If either a -< b or b ~< a, then 

\h{tp{b) - tp{a)) + k2{tq{b) - tq{a))\ > Mtpib) - tp{a)) + k^{tg{b) - tq{a))\ - ed (54) 

c) If a and b are causally un-related, 

\ki{tp{b) - tp{a)) + k2{tq{b) - tq{a))\ < \k3{tp{b) - tp{a)) + ki{tq{b) - tq{a))\ + e5 (55) 

Definition: Let S be causal, let A, 5, e and p be real numbers, let n be an integer, and 
let L/^^sA^) linear portion of S based on {A,5,e,n). Then the set S is said to be linear 
up to (A, S, e, cr) if 

tl(-B2;AA»('S') \ LA,S,e,n{S)) ^ ^ ^^^^ 
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Now lets go back to the coefficients ki,...,k4. As mentioned earlier, these coefficients 
correspond to 71, 72, 7ifi and 72^2. The reason they are viewed as completely independent 
of each other is that we are not allowed to use Pythagorean theorem, which means we don't 
have means of doing the usual derivations that are used to prove the dependence of 7 on i>. 
However, there is a geometric way of making this independence compatible with what one 
would expect of scattering when Pythagorean theorem does hold. Namely, the dependence of 
7 on f only holds if we assume g^^v^v'^ = 1. Thus, the independence of the four coefficients 
is equivalent to abandoning that assumption. This means that we can, statistically, enforce 
the expected dependence of 7 on ^; by "constraining" the "linear density" of our paths to be 
similar. This can be done with the following definition 

Definition: Let p, a, S and e be real numbers. A causal set S is linearly uniform up to 
((7, 5, e) with linear density p if, for every m < 5 {both positive and negative), 

n{p,q)\q e a {f{p,q) < (p- vf(p,g) > {p + e6)\t^{q)\)} 

where r(p, q) is the largest possible n for which one can find a sequence p ~< ri -< ... -< r„_i -< 
q and f(p, q) = maxijij), q),T{q,p)). 

Up to this point we were rightly assuming that distance is defined by longest path. But 
we ignored the fact that some of the paths that connect two PST elements on a plane might 
not lie on that plane. In order to fix that, we have to postulate that as long as the two 
elements lie on an OST 1 + 1 dimensional plane, the longest possible path that connects 
them lies on that 1 + 1 dimensional plane as well. Equivalently, this means that the longest 
path between two timelike separated elements lies on the intersection of all possible OST 
planes that pass through these elements. We will do this by first defining a notion of a pair 
of elements for which it is true in its neighborhood, and then postulate that most (though 
not all) of the elements of S are such elements. Thus, it boils down to two definitions: 

Definition: Let p and q be elements of S, let n be an integer and let A, 6, and e be 
reals. Then (p, q) is said to be shortcut free up to (A, 5, e, n) if whenever there is a sequence 
of elements ri -< ... -< rm such that ri and are both elements of F5,A,n(p? Q'l ti^h), there is 
also a sequence of elements Si -< ... -< Sm' such that ri = Si, r^ = Sm', Sk G F\^(^fS),n{p, q) and 
Tom' > Tom — ed, where tq is an atomic scale mentioned in 2.1 (it was introduced in order to 
say (5^ << S). 

Definition: Let A, 6, e and p be reals, and let n be an integer. Let A/^^s,t.n{S) be a 
subset of B2-A,s,n consisting of pairs of elements that are shortcut-free up to (A, 6, e) Then 
causal set S is said to be shortcut free up to (A, 5, e, a) if 

^{B2;A,5AS)\Aa,sAS)) ^ ^ (5g^ 

3.6 Distances in d OST dimensions 

We are now going to try and see whether we can go from two dimensions to d dimensions 
based on just what we have; or, if not, what additional axioms should we add. In order to 
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do that, we are going to make some loose jumps and make a geometrical argument that we 
would have made if we knew we were dealing with usual continuum geometry, and then we 
will go back and postulate just enough of the stuff that we need to be able in order to make 
the argument we just heard ourselves making. 

We have already established that, up to some statistical fluctuations, if the PST-based 
causal set S satisfies all of the above definitions then there is a local (in terms of OST) 
choice of d elements of PST which can be viewed as timelike vectors in OST that form a 
basis of a coordinate system. Lets call them Vq, v'^_^. They can be thought of as timelike, 
non-orthogonal, version of vierbines. 

Now, we are going to do a sort of Gramm Schmidt process. We start with out original 
element of interest, 

d-l 

Here, Einstein's summation convention was not used because Opo- are just coefficients, and 
they are not part of the vector or tensor. Now, we let w^_2 be a projection of w'^_^ onto 
the d — 1 dimensional hyperplane spanned by Vq through v^_2- Then we can view the space 
spanned by w'^_^ and w^_2 as a two dimensional plane. We then use the statistical argument 
on a plane to show that the distances work as expected, and, in particular, w'^_^Wd-i-^ = 
'^d-2'^d-2;M ~ ^d-i^d-i;ii, whcrc e'^_-^ = w'^_^ — w'^_2- Then, we similarly define w'^_^ to be a 
projection of w'^_2 on the space spanned by Vq through v'^_^ and repeat similar argument, 
and keep going until we reach 0. By induction, it is clear that we will get the desired metric. 

Now, lets go back and look at some of the gaps in the above argument that we are 
going to fill with axioms. Since we have already established that on the plane the geometry 
works, the only time we made a lose gap was before we knew we were dealing with a plane. 
One assumption that was made was that there is a unique PST element (or, equivalently, 
OST vector) Wd-i corresponding to the linear combination of different elements Vk- Now, 
as was extensively discussed earlier, it is true that one can travel specified time intervals 
ttd-i-k in the directions parallel to Vk (in terms of non local relative velocities of section 2.6 
as opposed to parallel transport), in specified order. However, while this would get one to 
a specific location, this, technically, has nothing to do with producing PST element w'^_^ 
since the latter is a vector at a point in OST, not a segment between two points. Thus, we 
have to assume some kind of one to one correspondence between segments and vectors that 
point in the direction given by that segment. This is accomplished by imposing the following 
restriction: 

Definition: Let 5" be a causal set and let A, 5 and e be reals and let n be an integer. S 
is said to be exponential up to (A, 5, e, n) if for any p & S and q & S satisfying 

max{\rp{q)\, \tp{q)\, \rg(p)\, \tg(p)\) < 5 (60) 

and 

max(\vp.n(q)\, \vq;n(p)\) < ^ (61) 
there exist p' and q' such that both p' e G{q') and q' e G{p') holds, which satisfy 

max{\rp{p')\, \tp{p')\, |v(p)|, |V(p)l) < (62) 
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and 

max{\rg{q')\, |V(?)I) < (63) 

In the above definition, p' and q' are rotations of p and q, respectively. So, what it 
says, is that there is a way to "rotate" both OST vectors to "line up" with the geodesies 
that connects them. While in smooth geometry, p' G G{q') and q' G G{p') are equivalent 
statements, in discrete case, as mentioned at the end of section 2.4, such is not the case. 
That is why in the above definition we had to explicitly say that both of them hold. 

Now, after we obtained w'^^-^ another assumption was made. Namely, it was assumed 
that one can take a projection of onto a. d — 1 dimensional hyperplane spanned by 
Vq through v'^_2 and also that that projection is unique. Furthermore, we are using the 
"perpendicular" direction of the projection operation as one of the coordinate axes; this, 
of course is a stronger statement than uniqueness, so once we take care of that, we will 
get uniqueness automatically. So we need two definitions. The first one is a definition of 
projection, and the second one is a definition of a certain property of projection that assures 
that it can be used as one of the axes. Now, in hght of discreteness we don't really want 
to say that projection is unique. Rather, we would like to say that if two different elements 
are both good candidates for projection, then they are close to each other. This means 
that projection should be defined as a subset (not an element) of a set T we are doing the 
projection onto. This subset conists of elemnts that are almost coinciding with each other, 
and each of these elements approximates what we mean by projection. It is officially defined 
as follows: 

Definition Let T be subset of S, p be an element of S, let A, e, 5, x ^ be reals, and 
let n be an integer. A projection of p onto T, which is denoted by Pp-n,A5'^,e^,x^,x'^{T) C T" is 
a subset of T consisting of elements q such that 

a) qeT 

b) If q' G T, Vg-niq') < A and ■max{\tg{q')\,rg{q')) < 5^ then rq/(p) > rq{p) + 

c) If max{\tq{q')\,rq{q')) > and Vg.n{q') < A then there is at least one q" such that 
max{\tq'{q")\,rg>{q")) < 5^ and ^(p) < rq{p) - 

d) tq{p) < £2 

The above definition is taylored for a specific case where T looks like a timelike hy- 
perplane, even though it weren't explicitly stated. In part b of the above definition, q' is a 
Lorentz rotation of q. If q points in a direction that is parallel to the plane, while the segment 
connecting p and q is perpendicular, it is easy to see that due to Lorentz transformations 
we would expect rq'{p) > r^^p) in a fiat continuum caes. So, if we take into account the 
curvature as well as fiuctuations due to discreteness, we instead get rqf{p) > rq{p) -\-e^ which 
is a somewhat weaker statement. 

On the other hand, part c of above definition talks about a scenario where q' is a 
displacement of q as well, rather than pure rotation. In this case, due to translation the 
Lorentzian distance between p and q' is smaller. Now, this doesn't necessary say anything 
about the comparison of r or i coordinates since we don't know in what way q' is rotated 
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relative to g, if at all. However, we do know that if we explore all possible rotations of q' 
we will be able to make r smaller than if we were explored all possible rotations of p and q. 
Now since in the latter case g, itself, is assumed to minimize r (due to part a), thats why in 
part b I was comparing q" to q instead of q" to, say, q* . 

Now, in light of A part b might not be true if q' is rotated relative to q by an angle much 
greater than A since in this case we won't have enough range to rotate q' to the desired vector 
q". However, things do work if all the elements of T have relative velocity with smaller than 
A with respect to each other. Such will be the case for -Fa,52 ,j(po;Pi, ■■■,Pfe;^i, which 
is the ultimate example we are interested in. 

We will now proceed and define the situation where projections can be trusted to form 
a coordinate system. We will use the same approach as before. We will first define a class of 
points for which such projections are unique in their respective cylindrical neighborhoods. 
After that, we will postulate that this set takes up most of S (and some small parameter 
(J will be used to define the word "most"). Now, we will denote a hypersurface spanned by 
pi, ...,pk to be 



Here, k was used instead of d because, as we repeatedly do Gramm Schmidt process, we will 
use all dimensions between 2 and d] thus, A;(< d) will be used as a dimensional parameter in 
the definition. We would like to say that the direction "perpendicular" to Fi^^s,n{po]Pi-i ■■■,Pk) 
as defined by projection, "commutes" with any direction parallel to F\^s,n{pQ', Pi, ■■■,Pk)- That 
will be defined by saying that if we have two elements outside of that hyperplane, and both 
of them have the same distance to their respective projections, and also if both of them 
are pointing in the same direction as their projections do, then the time shift, radius and 
velocity of these two elements relative to each other almost approximate to respective relative 
coordinates of their projections. More formally, it can be written as follows: 

Definition Let po be an element of S, let 6, e, x, A and A be real number and let n be 
an integer, p is said to be projection compatible up to (A, 5, e, x. A, n) if the following is true: 
Suppose Si and S2 are elements of S satisfying t'p,);A,n(si) < A, t'po;A,n(s2) < A, tp(,(si) < cr, 
tpoi^2) < i^poi^i) < ^) ^po('52) < S. Furthermore, suppose the following conditions are 
met: 

(i) S'l e Psi;n,A52,62,x2,A2(^AAn(Po;Pl, 

(ii) 4 e -Ps2;n,A52,e2,x^A2(-^A,5,n(Po;Pl, 
(in) max{Vs'^;n{si),Vs'^;n{s2)) < S 

(iv) \rs'^{s2) -rs[{si)\ < 5^ 
Then the following will be true: 




(64) 



Vi(|tf|<5) 



a)k4(4) -^.i(s2)| 
^)\ts[{s'2)-tsAs2)\<e^ 
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Definition: Let 5 be a causal set, let S, e, ^, A and a be real number and let n be an 
integer. Let -ffA,5,e,x,A,n be a subset of S consisting of elements that are projection- compatible 
up to (A, S, e, X, A, n). Then 5* is said to be projection compatible up to (A, 5, e, x, A, n, a) if 



4: Fields on a causal set 

Throughout this chapter, the distances are going to be scaled by a small parameter tq, 
so that they no longer have to be integers and the smallest distance is much less than 1. 
This is done in order to make sure that, for small distance scales 6, we have 5^ << 6 which 
would allow us to define approximations. 

4.1 Key advantages of PST-based fields over OST-based ones 

We will now introduce the matter fields on a causal set. As was mentioned earlier, 
one of the main conceptual issues in doing this is the lightcone singularity. Due to the 
fact that we need to define a discretized verion of the "derivative" of a field in order to 
define Lagrangian density at point p, we need to look at nearby points of p. But, since the 
Lorentzian neighborhood is defined by a vicinity of light cone, the point p has infinitely many 
scattered points within its e-neighborhood, and most of them have arbitrary large coordinate 
separation from p. Thus, if we use Lorentzian distance as the only clue in defining discretized 
derivative, that derivative would take into account the values of the field at all the "far away" 
points we just mentioned, which means that the result will have nothing to do with actual 
local value of a derivative of a smooth function, if such existed. I consider this to be the 
major conceptual issue in defining quantum field theory in a causal set background, and 
adressing this issue is the main motivation for the approach proposed in this paper. 

Some attempts to define quantum field theory on a causal set were already made in JT\, 
[2\, [4J and [3j. As far as [T] and [2] goes, scalar field propagators were, in fact, successfully 
defined without any light cone singularities. This is due to the fact that a propogator is a 
function of two points, rather than just one. Suppose we are interested in computting the 
propagator between points p and q, where p -< q. The only part of the future half of the 
lightcone of p that is relevent is the one that is in the past of q. This allows us to get rid of 
the infinite volume we were concerned about. 

Physically, this can be viewed as a consequence of the fact that the line connecting the 
"emission" and "absorption" points (or, equivalently, the line connecting sources and sinks) 
defines a "preferred frame". However, once more than one propagator is introduced, one 
immediately faces the situation that these two "preferred frames" can move arbitrary close 
to speed of light relative to each other. In fact, this phenomenon implies that even if we only 
restrict ourselves to propagators across very small distance, we can still have infinitely many 
propagators since there are infinitely many points in vicinity of light cone to choose from. 
This, of course, is just one of the symptoms that indicates that the light cone singularity is 
back in the picture. 




(65) 
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This is related to a reason why it was much more natural to avoid lightcone singularity 
in [T] and [2] than in j4] and [3j. Since the Lagrangian formalism hides in itself the key 
principles of entire quantum field theory as we know it, it also "hides" the possibility of 
arbitrary many propagators due to virtual particles. This can be seen by the fact that a one- 
element set is an intersection of all possible two-element sets containing that element. Thus, 
Lagrangian density, which is a function of a single point p, is really a function of intersection 
of all possible pairs of points, that cointain p; thus, it includes all possible propagators which 
have p as either emission or absorbtion point. Since the actual Lagrangian was used in j4j 
and [3] as opposed to [1] and [2], the former approach faces the lightcone issues and not the 
latter. This, of course, comes with a price that [1] and [2] fails to represent the real life with 
arbitrary many virtual particles. While, of course, such scenario can be described within [1] 
and [2], its absence is not ruled out, which makes the theory incomplete. 

Now let's look more specifically at what happened in [1] and [3]. In [3], a vector valued 
function f '^(0, x) was defined to be a direction in which (p varies the least in the vicinity of x. 
It was noticed that in case is linear, such direction coincides with a direction of gradient 
of (p. This lead to defining the kinetic term of the Lagrangian as 



This, in fact, naturally restores locality without sacrificing relativity. However, this creates 
another problem: the definition of v{(f),x) is very non-linear, nor can it be seen as a pertur- 
bation to linearity, either. Thus, one can not use that Lagrangian to compute propagators 
or draw Feynman diagrams, since all of these are based on linearity. I belive this to be the 
main reason why one is limitted to numerical methods when it comes to actually solving 
some physical problem on a causal set background. 

On the other hand, in [4J such issue does not exist if (p is assumed to be linear. For any 
given point p, and any unit timelike vector v, the Lagrangian density was defined as 



where r is some small constant and and are dimension (d) -dependent coefficients 
(here, expp{kv) is an element of a smooth manifold one can reach by travelling distance k 
from point p along the geodesic whose tangent at p is a unit vector v). It can be easilly 
shown that, for the case that is linear. 




(66) 



£ = -^{(j){expp{Tv/2)) - (f){expp{-Tv/2))f 




(67) 




(68) 



and 




rf"g(0(r) - 0(p))2 = Cdido<Pf + 5^(9^0)^ (69) 
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Thus, the Lagrangian takes the form 



d-l 



C={Ad- BdCd){do(l>f - BdDdJ2i9k<pf (70) 

k=l 

Now, Cd and are dictated by geometry, while and are up to us to define. So if we 
define them as 

Ad = l + ^; Bd = ^ (71) 
J^d J^d 

we will obtain 

C = (72) 

as desired. 

This, of course, is less than satisfactory. After all, the causal set is relativistic by the 
very setup, so one would expect relativistic covariance to appear naturally, while in the 
above situation if the coefficients Ad and Bd were not adjusted properly we would get non- 
covariant expression, where the non-covariance will manifest itself in f-dependence. On the 
other hand, in |[3j v-dependence is embraced with the explanation that f is a function of 
and thus, overall, everything is covariant. The price, however, is that v depends on in non- 
linear way, which causes a problem. On the other hand, in [1] one gets rid of f-dependence, 
which allows v to be random, which gets rid of non-linearity. In this respect one can claim 
that unnatural way of defining Ad and Bd is the price one can choose to pay in order to 
regain linearity. 

However, it turns out that this price is not sufficient in the case of non-linear 0. In 
light of the fact that r is finite, we can not assume exact linearity in a region of size r. In 
fact, since most of the values of v are arbitrary close to the light cone, the Alexandrov sets 
corresponding to these v "stretch out" arbitrary far coordinate- wise, which results in arbi- 
trary large error in computting Lagrangian density for non-linear 0. In order to address this 
issue, in [1] some specific procedure was introduced through which, for any fixed p, one can 
select f in a way that would minimize the fiuctuations of in the interior of Alexandrov set 
a{expp{—Tv/2), exppirv /2)) (thus, if there is at least one frame in which is approximately 
linear, v will correspond to such a frame). This, of course, implies that f is a function of 
both p and 0, just like it was in |3]. The dependence is just as much non-linear in one case 
as in the other. Thus, we have established that [3] is more "linear" theory than [5] only in 
a special case where is linear. Since path integration takes us outside of that special case, 
sticking to |1] doesn't adress linearity issues, nor does it allow us to do perturbation theory. 

This is merely a symptom of a far more general problem. In light of the non-compactness 
of Lorentz group, the Lorentzian neighborhood has infinitely large volume as well as infinitely 
large projections on coordinate axes. As a result, we were forced to select sub-neighborhood 
which has small volume, and small projection on at least one coordinate system. That 
sub-neighborhood defines a preferred frame, which is designated by a vector which is 
stationary in that frame (in the above cases the correspondence was established through 
exponential map; in general, it doesn't have to be that way, but still there would be some 
kind of correspondence). In order for the choice of not to violate relativity, has to be a 
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function of the behavior of the fields. That function is non-hnear, which brings non-hnearity 
into the theory. 

In the PST-hke case described in this paper both problems can be reversed. Since 
the neigborhood in PST has finite size and finite projections on PST coordinates, both of 
which can be made arbitrary small, we no longer need to select sub-neighborhood. To put 
it another way, a finite PST neighborhood is a sub-neighborhood of infinite OST one, thus 
replacing OST with PST fulfilled our need of selecting sub-neighborhood. As stated in the 
analogy between translational and rotational covariance in the introduction, replacing OST 
with PST essentially allows us to violate relativity legally. In this case, it provides a legal 
way of defining sub-neighborhood. Furthermore, in light of PST, f'^ becomes one of the 
variables, which means that it can no longer be viewed as a (non-linear) function of (p. This, 
of course, removes one and only source of non-linearity of the theory. 

We still retain the claim of [5] that f ^ approximately coincides with the direction of 
gradient of (p. But, since is now a function of and not the other way around, this is 
accomplished by imposing dynamics on (p rather than f^. Thus, the issue of non-linearity 
of f'^ is avoided. An obvious question one might ask is that we might have just traded 
non-linearity of for non-linearity of (f), by imposing non-linear constraint on behavior of 
(f) with respect to rotation of d^cp relative to v^. This can be answered as follows: while it is 
true that imposing a restriction on (p is non-linear, if we expand as a Fourier series. 



then the above restriction simply amounts to either significantly reducing coefficients of the 
harmonics v{k, v) for which k and v have significant enough mismatch or throwing them away 
entirely. At the same time, what we do with each of the harmonics remains linear. Even 
if some harmonics have significantly reduced, or no, impact, we still can draw Feynmann 
diagrams, while remembering v and k dependence of the coefficients that we put before the 
propagator. Nevertheless, as will be shown in the section 4.3, the difference in coefficients 
will cause significant mismatch with well known results of quantum field theory, unless the 
situation is treated with great care. 

4.2 Curvature effects 

The discussion in the previous section was assuming fiat space time. One has to be a 
little more careful when curvature is introduced. If we parallel transport a vector around 
an arbitrarily large loop perpendicular to itself, the vector will be tilted by arbitrary large 
amount once we get to original point. So, if (p does not change as we perform such parallel 
transport, we would be forced to conclude that (p{x,v) = (p{x,w), for all v and w. Thus, (p 
would effectively become a function of x alone, which would bring us right back to where we 
were before we introduced PST. In order to avoid this, we have to allow (p{x, v) to vary a 
little bit in the direction perpendicular to v. We simply have to impose an upper bound on 
that variation, which happens to be a very small number. This still means that we will have 
a problem in case that spacetime curvature is very high. For example, once the curvature is 
high enough, different wave modes can no longer be viewed as independent variables. This 
is related to the issue discussed in [12] where it was shown that due to the global nature of 




(73) 
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the notion of a particle, such notion is not well defined in a curved spacetime. 

In [12] this issue was addressed by hypothesizing asymptotically fiat spacetime. I con- 
sider it unsatisfactory because if the spacetime is asymptotically fiat, the non-fiat region has 
a global "shape" that would break relativistic covariance. In fact, this is more cynical than 
the finite size of the universe postulated through big bang theory. In case of the latter, it was 
suggested by a lot of prominent names in cosmology that we should picturing ourselves to 
live on a four dimensional surface of a baloon that expands in higher dimensions; this means 
that it doesn't have any boundaries within the four dimensions. The issue of assimptotically 
fiat behavior is much harder to treat since it is assumed to be found within our four dimen- 
sional world. It is possible to experiment with taking extra dimensions seriously and trying 
to take advantage of fiatness in the region "outside of the baloon", but the success of this 
endeavor is questionable. Of course, from cosmological point of view it is possible that our 
universe has well defined shape, regardless of what big bang theorists want us to believe. At 
the same time, the halmark of relativistic covariant theory is that it doesn't demand that. 
In fact, as a local theory, it shouldn't demand anything at all on a larger scale. In other 
words, it has to be compatible with infinite universe, whether the universe is actually infinite 
or not. 

One might argue that in light of replacing OST with PST there is a way to somewhat 
remedy the situation. At least in the case of fiat spacetime we can "slice" PST subsets with 
constant velocity coordinates in each slice, and claim that each slice has its own shape so 
that the "prefered direction" determined by the shape of that specific slice coincides with 
the direction of the common velocity of its elements. This, however, does not adress all the 
problems. First of all, as was just mentioned, we were considering global slices of constant 
velocity. This means that we had to assume fiat spacetime in order for that slicing to be 
well defined. Consequently, this does not adress the curved spacetime scenario, which is 
what we were trying to adress in the first place. Secondly, even if we argue that rotational 
covariance is not violated, such boundaries of the universe still violate the translational one. 
Of course, one can argue that the presence of big bang and finite age of the universe violates 
translational covariance as well. But as was stated before, in order for the theory to be 
relativistic, neither big bang nor assimptotic behavior should be demanded by the theory. 

For these reasons, I hope to develop a theory where asimptotic spacetime is only a 
special case. At the same time, we will hypothesize that we live inside of that special case. 
We can then try to propose a theory that is well defined in general case. But this special 
case is the only scenario where the theory has to agree with a traditional quantum field 
theory in a curved spacetime. In particular, I claim that in a general case the presence 
of curvature prevents us from using principles of superposition, so the results can only be 
obtained numerically. But in the special case of assymptically fiat spacetime, we can attempt 
to predict the behavior of particles in assymptotically fiat region. That behavior will include 
"remembering" what happened in the curved spacetime region. The prediction of what 
would be "remembered" a billion years after the fact will be equated with the prediction 
of actual events. My hope is that decoherence phenomenon will help to define mechanism 
of this memory. This, however, is questionable since decoherence is designed to work for 
the case of superposition of harmonics, and curvature, as stated earlier, mixes otherwise 
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separate hormonics of fields propagating on curved background, even tliough curvature is 
assumed to be non- fluctuating (which means that there is no gravitational field as such). 
The exploration of these questions is still up to future research. 

4.3 The dynamics that Umits unwanted degrees of freedom 

In this section, we will take a brief look at different kinds of ways of limiting the variation 
of the field perpendicular to the direction of v^. Let us start by exploring the ways not to 
do it, and why they fail. The first thing one can think of is to add to the Lagrangian the 
derivative term in the direction perpendicular to with a very large coefficient, C: 



^-V'-(.'^)%C=((.^^)^-^|^) (74) 

If we expand as 

(t>{x, v)^ j d'^k^^''(t>{k, v) (75) 

this would result in a very large variation of Lagrangian, whenever the difference between v 
and k is not too small, thus giving us hope that, in such situation, the nearby trajectories 
will cancel each other by interference. The propagator associated with this is 

(76) 



w? + {kvY + c{{kvY - P) 

where kv = k^v^ is a summation over OST indices. On the one hand, the propagator is a 
very small number, unless v and k are very close, as desired. On the other hand, however, 
due to the fact that C is so large, even when v and k are, in fact, very close, the propagator 
might vary wildly. Roughly speaking, it amounts to a particle having substantially larger 
mass, as soon as k and v deviate from each other by very small amount. Furthermore, that 
"mass" variation is continuous. This, of course, contradicts everything we see in a lab, which 
means that this approach does not work. 

The other idea is to multiply C by an imaginary unit i. The motivation for doing that 
is that in the path integral it would give a factor e*^*^^ = e~*^, which is a very small number. 
But, if we write down the propagator, we will get 

(77) 



m? + {kvY + iC(P - {kv f ) 



This propagator mimics all the properties of the previous one: its value is very small unless 
k and p are very close to each other, but, unfortunately, it changes by very large amounts 
when they are. The reason e*^'*^^ = didn't solve anything is that, in the situation when 
k and p are very close, Q-c{k'^-{'^vf) ig no longer very small; thus, its variation due to large 
value of C makes non-neglegeable difference. 

Thus, the key issue in both of the above cases is that we would hke to have it both ways: 
on the one hand, we would like C to be very large so we can force v to be close to /c, and, 
on the other hand, we don't want it to be too large, so that we can use the fact that v and 
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k are close to claim that the difference between the two is unimportant. The problem is that 
formal multiplication by either C or iC doesn't allow us to have it both ways. 

The solution to this dilemma is to restrict the range of integration by hand instead of 
imposing extra terms into Lagrangian. One can argue that, from strictly mathematical point 
of view, we are already used to doing that in standard quantum field theory. In particular, 
we use this concept when we postulate ultraviolet or infrared cutoff in path integration. In 
the situation at hand, we postulate that {d(l)/dx^){x^,v^) is almost parallel to v^. In other 
words, the local space derivative of 0, in a reference frame corresponding to a given element 
of PST, can not be larger than some small constant. In this case we can, in fact, "have it 
both ways" . The restriction, being all-or-nothing by nature, does not have any effect at all 
as long as the value of the derivative is smaller than that constant; at the same time, it has 
the largest possible effect once it gets larger (even by a very small amount) , which amounts 
to getting rid of unwanted terms entirely. 

A causal set version of the restriction of the spacehke derivative of can be defined by 
restricting our range of integration to e J^5,e;n, where Ts,e;n is defined as follows: 

Definition: Let 0: ^ M be a scalar field, e e M+, 5 e M+ and n e N+. Then is an 
element of J-'s,e,n if and only if, whenever tp{q) and Vp.niQ) are both less than 5^, while rp{q) 
is less than S, we always have \(p{q) — < ed. 

In the above definition, the contrast between 5^ being an upper bound on the shifts 
of tp{q) and Vp-n{q), and 6 being an upper bound on the shift of rp{q) indicates that the 
direction of the shift is parallel to the direction of the radius, up to a small angle S (which, 
for pure convenience of saving letters in the alphabet, happened to coincide with a small size 
of a neighborhood, although the two have nothing to do with each other). Now, since we are 
within S neighborhood, the variation of the field is expected to be bounded by something 
first-order-small, or, in other words, something of the order 6. Thus, the fact that it is 
bounded by e5 instead, which is second order small, indicates that the derivative of 4> in the 
direction of the shift is first order small. As we have estabhshed, the direction of the shift 
is parallel to radial direction up to first order variations. Thus, the above definition implies 
that the radial derivative of is first order small as well. This is equivalent to saying that 
the OST projection of the derivative of is almost parallel to t axis in a reference frame of 
PST element of interest. This, in fact, is the ultimate intention of the above definition. 

Finally, we implement the restriction e J'd,e,n by defining the probability amplitude as 
the integral only over such 4>: 



4.4 Kinetic, mass and interaction terms of the Lagrangian 

Now that we spent some time discussing ways of getting rid of unwanted degrees of 
freedom, we will now shift gears and describe the degrees of freedom that we do want. In 
particular, we will analyze the structure of non-interacting as well as interacting terms of 
the Lagrangian and the way it differs from the OST case. 




(78) 
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In light of the fact that the gradient points roughly in v'^ direction, the kinetic term of 
Lagrangian for a scalar field can be written as 

Cmx,v)=[v'^^y (79) 
where /x runs only over OST coordinates. The causal set version of this is 

cmp) = [j2 E (80) 

j=-nqeGj{p) 

On the other hand, in order to allow for the momentum exchange during the interaction 
of two different fields, the continuum version of the Lagrangian density associated with 
interaction of and x is given by 

C{(f)){x,v)= (j){x,v')xix,v") (81) 

J v' ,v" <=NAiv) 

Here, N\{v) is a very large neighborhood of v, whose size is determined by a very large 
number A, which is viewed as ultraviolet cutoff of path integration. As before, the reason 
we can afford having A large is that there is no curvature in velocity coordinates. As in 
ordinary quantum field theory, the conservation of A; is a result of the integration of Fourier 
harmonics that gives unless frequencies match. The conservation of v is the result of the 
constraint v ^ k. This means that if the above coupling violates conservation of v, only the 
second order terms survive; the latter form a vortex which allows for conservation of both k 
and V. 

In the discrete case, due to the fact that e ~ {x,v), e' ~ {x,v') and e" ~ {x,v") are 
completely different elements of a causal set, we can not assume that they share the same 
value of X (which is one reason I just put approximation signs). Thus, x has to be likewise 
replaced with x' and x" in the last two cases, where both x' and x" are constrained to 
some neighborhood, ns{x). However, we would like to say that while Nj>^{v) is very large, 
ns{x) is very small. Since x and v are indistinguishable from each other, there is a single 
neighborhood, A^A,<5(e), which consists of e' whose x-component differs only slightly from 
e, while f-component differs a lot (I decided to use e for an element of causal set in this 
particular section in order not to confuse p with momentum; I am greatful to [15] for giving 
me an idea to use this particular letter): 

A^A,5;n(e) = {e'l max (|te(e')|, |re(e')|) <6A \ve-nie')\ < A} (82) 

Here, n is the constant taken from the section 2.6 which is used to define velocity Ve^niQ)- 
The causal set version of the interaction Lagrangian is 

A2(0,x)(e)= E <l^ie)x{e") (83) 

{e',e"}cAfA,i;„(e) 

This means that, in the approximation where A is assumed to be infinite, the interaction term 
looks like (j){x)x{x) rather than 0(x, f )x(x, f ). However, one still uses 4>{x,v) when it comes 
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to non-interacting kinetic term, which is what makes it possible to introduce contraction 
of d^cj) with v^^. Physically, this means that no momentum exchange occurs during the 
propagation of a particle, while it does occur at vertexes. However, if one chooses to interpret 
momentum exchange in terms of shift in between steps as opposed to coupling between 
different v^, one would find that, in light of the nature of random walk in PST, there is a 
small momentum exchange even in case of free Lagrangian. This can be vaguely interpretted 
as "momentum exchange between the propagating particle and the elements of 5"', and this 
can be viewed as vacuum energy. This picture, however, is very vague since the lifetime of 
each element of is which doesn't allow them to " live" long enough to gain or lose energy. 

Now lets look at a mass term. Despite the fact that algebraically it looks like the 
interaction of with itself, in reality it does not involve any interaction at all, as it is part of 
a free propagator. In case of quantum field theory based on OST, this understanding doesn't 
affect the way we do algebra. In case of PST, however, the fact that no interaction occurs 
has to be explicitly incorporated in that we do not have a coupling (f){x,v)(l){x,w) if is 
too different from v^. Thus, the mass term is given by 

Cynass{4>){x, v) = m^(f)*{x, v)(f){x, v) (84) 

and causal set version of this is simply 

Cmassme)=m^(l)*ie)(l)ie) (85) 

However, self-interaction 0^ term continues to be defined in a way that allows for momentum 
exchange, since that is what we expect when we perform 0^ loop diagrams: 

Cint{(j)){x,v) = X (f){x,Vi)(f){x,V2)(j){x,V3.)(j){x,V4) (86) 

JvieNA{v) 

and the causal set version of this is 

At^(0)(p) = A Yl </>(ei)0(e2)0(e3)0(e4) (87) 



4.5 Electromagnetic field 

We conclude this section by introducing electromagnetic field on a causal set. In order 
to be able to give stationary frame of a photon, we will postulate a small, but non-zero mass 
for an electromagnetic field. The same was done on p. 36 of [8]. 

In case of continuous geometry, we define electromagnetic field to be a function A : M. psr — ^ 
AipsT- The Lagrangian for an electromagnetic field is given by 

C = ^Ff^'F^, (88) 

Similarly to what was done for scalar field, its values are restricted in such a way that A{x, v) 
varies very little in a direction perpendicular to v^. If we define t axis to be the direction of 
f ^, we can get rid of all of the space derivative terms of the Lagrangian, which leads us to 
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Since we already know that a vector is identified with PST element, the first thing that comes 
to mind is to view electromagnetic field as a scalor field on PST. This would also suggest 
a very appealing concept of unification of scalar and gauge field. Unfortunately, however, 
according to the model presented in this paper, PST only consists of timelike vectors, while 
the field A'' is spacelike. This, too, might be something to re-think in the future research: 
after all, if the electromagnetic field was timelike it would have a natural physical meaning, 
namely phase shift of a moving particle that interacts with it, while spacelike electromagnetic 
field lacks that meaning. Unfortunately, at this point I don't see how to make that compatible 
with the wave front picture that I would like to subscribe to in the current paper. For this 
reason, I am forced to stick with an idea of spacelike electromagnetic field as far as this work 
is concerned, which does not allow me to view it as a scalar field on PST; although I still 
intend to come back with these other ideas in future research. 

In light of this, I have two choices. One choice is to consider two PST elements, one 
pointing forward in time and the other pointing backwards in time, and claim that these 
two timelike directions add up to the spacelike one. The other approach is to view the 
elements of PST in terms of their projection onto OST and notice that a pair of spacelike 
separated elements (whether PST or OST) can determine a spacelike direction based on 
the geodesic that connects them. The latter has been an approach in OST-based theories 
of electromagnetic field in [3] and [1]. We notice that, even though the reasons are very 
different, in both cases we are forced to select two PST elements rather than just one. Thus, 
neither of these models would give us the unification of gauge and scalar fields that we desire. 
In light of this, we stick to the view proposed in [3j and [4j simply because it is easier to 
impliment. 

According to [3] and |1] an electromagnetic field was defined as a holonomy, a: S x S ^ 
M, which represents the path integral of A'^ along the geodesic connecting a given pair of 
points. Strictly speaking, in case of a function / : S ^ Ai, the pullback of A^ on Ai is given 



where 'j{f{p), fig)) is a geodesic connecting f{p) with /(g) (see [H]). 

Now, a causal set version of the restriction A^ ^ and d'^A'^ ^ is a G and a G 

(2) 

•^Se-n respectively. We would now like to rigorously define these. In order to enforce locality, 
we restrict ourselves to the neigbhborhood defined by max{\tp{q)\, |rp.„(g)|, |t>p.„(g)|) < 6. In 
order to speak of A^, we have to speak of a{q, s) where q and s are shifted in t direction 
relative to each other. This can be enforced by saying that their r-shift is of the order of 
6"^. This would leave t and v as the primary directions of the shift. Thus, we would have a 
combination of A^ and A^ (we will come back to A^ shortly). Now, in light of the smallness 
of the neighborhood, a{q, s) is expected to be of order 6. So, in order to imply that A^ = 
we have to say, instead, that it is bounded by e6. Now, apart from implying A^ = 0, this will 
also imply A" = 0. I consider this to be an important side benefit. After all, if such weren't 
the case, we would have been forced to view A^ as an element of a tangent bundle to PST 
(or, equivalently, a tangent bundle to a tangent bundle to OST) rather than PST itself (or 
equivalently a tangent bundle to OST). While, technically, such is still the case, due to the 
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unwanted degrees of freedom being nearly 0, A^^ can still be approximated as an element of 
PST. Thus, we obtain the following definition: 

Definition: Let d and e be reals, and let n be an integer. Let a: S x S — )■ M 
be a real valued function. That function is an element of A^H^^ if and only if whenever 

max{\tp{q)\, \rp.n{q)\, \vp-n{q)\) < S and \tp{q) - tp{r)\ < e6, we have \a{q,r)\ < 

(2) 

Now we will move on to the definition of A^^^. This requires a notion of two different 
pairs of elements, (p, q) and (p', q') which define the directions of the holonomy, corresponding 
to two different pairs of points, being "parallel" to each other. In light of the fact that v- 
component of A is nearly zero, these four elements of PST, or, equivalently vectors in OST, 
are viewed as points in OST, and the only relevant direction is the line that connects them. 

However, it is still "easier" to define the parallehsm then it would have been in the case 
of true OST. In light of the fact that we would like to focusc on the space component of the 
vector potential, we would like to assume that p, q, p' and g' share the same hyperplane and 
point in perpendicular direction to that hyperplane. This can be stated more precisely by 
postulating that both their time and velocity coordinates with respect to each other are 0. 

We can now view the imaginary line connecting p and g as a "projection" of a geodesic 
passing through some other vector, r, onto a spacelike plane perpendicular to the common 
direction of p, g, p' and g'. In order to define the projection, we need to draw lines passing 
through p, g, p' and g' that are perpendicular to that common plane. This is easy to do, 
since these lines are nothing but geodesies passing through these four PST elements (which 
are viewed as OST vectors). 

Thus, when we say that p is a projection of an element of a geodesic passing through r, 
we mean that there are vectors p* and r* on geodesies passing through p and r respectively 
such that Tp. (r*) and rr*(p*) are both close to 0. Likewise, p' and g' can be viewed as a 
projection of a geodesic passing through some other vector, r' . 

We can define these two geodesies to be parallel to each other by postulating that their 
velocities with respect to each other are very small. Finally, we can postulate that, as long 
as these geodesies are parallel, the segments are parallel as well. Furthermore, the segments 
have equal length as long as these geodesic segments have equal length, where the length 
can be defined in terms of t-coordinate along each geodesic. 

This brings us to the following definition: 

Definition: Let 8 be real and n be an integer. Let p, g, p' and g' be four elements of S. 
We say that (p, q) is parallel to (p', g') up to ((5, n) if the following conditions are met: 

a) max(|rp(g)|, \rp{p% |rp'(g')|) < 5 

b) max{\tp{q%\tp{pJ%\tp\(i)\)<P 

C) max(\Vp.r,[<i)\, |^p;n(p')l) IV;"!^')!) < ^'^ 

d) There are elements r, r', r\^ r\^ r'^*, r'^^ p*, q*, p'* and g* such that 

(i) {rl,r*} C G{r), {r'*,r'*} C G{r'), p* e G{p), q* e G{q), p'* e G{p'), and q'* e G{q') 
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(ii) max{rp*{rl),rq*{r*),rp'*{r'*),rg'*{r'*)) < 5^ 

(iii) max{tp*{rl),tq*{r^),tp>*{r[*),tq>*{r'^)) < S'^ 

(iv) max{vp.n{p'),Vp>.n{p)) < S'^ 

(v) \tr*ir*2)-tr':ir'*)\<5' 



In the above definition part a, whicli employs 6 rather than d^, doesn't say that anything 
is "small". Instead, it simply assures locality. Parts b and c say that t and t;-components 
are, in fact very close to each other, while r component is not (since 5^ « S), thus implying 
that the vectors share the same hyperplane and all point perpendicular to that hyperplane. 
In part d, (i)-(iii) imply that the each pair of elements is a projection of a pair of elements 
on corresponding geodesic. Part (iv) says that these geodesies are parallel to each other, and 
finally part (v) says that the segments of geodesies whose projections we are looking at have 
equal length, thus implying equal distances between the elements in each pair. 

We are now ready to define Afl^. Basically, the key is to say that if {p, q) is parallel 
to {p', q') up to some order, the difference between a(p, q) and a(p', q') should be very small. 
Now, due to the fact that the size of a neighborhood is 5, if the hue I connecting p and q 
was tilted by arbitrary angle to the line connecting p' and q' , then the difference between 
a{p,q), and a{p',q') would have still been of order 6. Furthermore, again due to the small 
size of 5-neighborhood, even if the space derivative of was non-zero, the continuity would 
still demand that the angle between the two lines is of the order S. This means that the 
difference between holonomies is of the order 5"^. 

Now, in order for us to say that space derivative is, in fact, we should say that the 
difference between holonomies is smaller than expected. In other words, we have to say that 
the difference is of the order of S^. But, as usual, S will be the restriction on a domain, while 
€ on the range of the field. It will be understood, however, that the two small parameters 
are of the same order of magnitude. This means that restriction on variation of a{p, q) is 
eS'^. This brings the following definition: 

Definition: Let a: S x S ^ M.he a real valued function on a set of pairs of elements of 
S. The function a is an element of Ag^^ if and only if whenever (p, q) is parallel to (p', q') 
up to {n, S^), we have \a(p, q) — a(p', q')\ < e5^. 

This defines our domain of integration: 



Now, the only thing that is left is to define S{a). In fight of the previous discussion, the 
only thing we need is d^A'^. This can be simply computed by using a(p, q) — a{p, q), where 
p G Gmip) and q G Gm{q)- If we like, we can average the different values of n. Thus, the 
electromagnetic Lagrangian is given by 




(91) 







(92) 



min(\tp{q)\,\vp-n{q)\)<S'^;\rp{q)\=S peGm(p),qeGm{q) 
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5. Conclusion 

As Sorkin mentioned in [7] , causal sets are highly non-local due to light cone singularity. 
However, in the present work it is shown that the problem of light cone singularities is 
completely solved once the causal set is replaced with PST. Mathematically, this is due to 
the fact that there is nothing illegal in "neighborhood" on an OST violating translational 
invariance; thus, similarly, there is nothing illegal for a neighborhood on a PST to violate 
Lorentz covariance, either. Thus, every element of a PST, which is a vector in OST, can use 
its own direction as a "preferred" one when it comes to cutting off near-lightcone region. 

Physically, light cone singularities that arise in the case of OST are the result of uncer- 
tainty principle, since, if it wasn't for that, we wouldn't have to integrate over all possible 
trajectories. Thus, instead of viewing each element of a set as a localized object with infinite 
position and zero momentum uncertainty, we viewed it as a wave packet where both un- 
certainties are finite. Thus, a position mean, together with momentum mean, of that wave 
packet determines the element of a PST that corresponds to it. 

The notion of well defined direction assigned to each element of PST allows to define 
the geometrical parameters easily and then generalize them to an arbitrary partial ordered 
set. Of course, an arbitrary causal set does not have a geometry, which means that the 
definition of these geometrical quantities is very formal. But, the fact that these quantities 
are defined, allows us to come up with well defined notion of what we mean by a causal set 
being PST-like in terms of local maps. 

Furthermore, in case of a PST, a natural way of defining coordinates arises, without any 
assumption of the presence of continuum geometry. These coordinates allow a natural way 
of defining a "PST-like" causal set. While it could have been possible to use similar mapping 
in defining "manifold-like" causal set in OST case, the light cone singularities would imply 
that a causal set is only manifold-like in some frames and not others. Thus, if we are to 
demand that such frames be found, we would fundamentally set these frames up as "preferred 
frames" compared to others. On the other hand, in PST case it is possible that causal set 
is manifoldlike in every single frame because the velocity is viewed as extra dimension and 
thus the "fast moving fields" while still undergoing the Lorentz contraction, do not "overlap" 
with slow moving ones in terms of PST (while they do overlap in terms of OST). 

Now, of course, local maps can similarly be introduced for more usual, OST-based 
concept of a causal set. But, in light of a light-cone singularities, a causal set can't possibly 
be manifold-like in these fast-moving frames. Thus, one is forced to constrain it to be 
manifold-like in some frames but not others, which violates relativity. 

In case of a the discretization of PST, on the other hand, the notion of a local region is 
already established without violation of relativity. Thus, it is possible to postulate PST-like 
property of every single local region, without postulating any preferred frame. After all, in 
every single frame, the behavior of the vectors that move fast with respect to that frame 
is singular, while the behavior of vectors that move slowly is not. The latter, and not the 
former, is what is used in defining the geometry. 

Finally, the fields and Lagrangians were successfully introduced, in such a way that 
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analytic computations can be done for causal sets with arbitrary many vectors, without 
relying on numeric work. The key of being able to do analytic computations is superposition 
principle, which allows one to expand the field into Fourier harmonics. In previous papers 
([1] and [3]) non-linear procedure was used in finding a frame in which the variation of a 
given field is reasonable. That procedure was introduced in order to get rid of light cone 
singularities. The non-linear nature of the analysis of the behavior of the field is what 
prevented us from using superposition principle. In this paper, this does not need to be done 
since we are told from the start exactly what frame corresponds to what element of a causal 
set; the dynamics is designed in such a way that, as long as we stick to these frames, we are 
guaranteed to avoid singularities. 

At the same time the theory still has some weaknesses. As far as axiomatization of 
Chapter 3 goes, no proof was made to rigourously show that these axioms formally imply 
manifold strucutre. In this sense it is worse than [9] where they did do all the proofs, for 
OST case. However, while [9] was concerned about continuum case, I am concerned about 
discrete case. In light of this it would be several levels harder to make such proofs. While 
this is still a good topic for further research, it is possible that we should just do numeric 
simulations instead (which haven't been done either and thus, too, a topic of future research) 
and just see empirically if a given axiom system is complete enough. 

As far as chapter 4 its main weakness is that I have limitted the range of integration to 
the functions where the direction of OST derivative approximately coincide with a direction 
of vectors corresponding to PST element. While I did show a qualitative argument that 
this procedure would generate Fourier transform, this argument is not explicit enough to 
be satisfactory. Again, I believe the main obstacle from doing more explicit work is the 
difficulty associated with describing random distribution. It is possible that in the future 
work I would have to consider some toy models, such as, for example, different regular 
lattice structures "moving" with different velocities, superimposed on each other, and see 
what happens. Nevertheless it is still interesting to explore how much analytic work can be 
done for the case of random Poisson distribution in light of the fact that at least one of the 
main obstacles, namely non-locality, was taken away. 

Finally, the other weakness of Chapter 4 is the fact that the spirit of a constraint that 
the direction of PST element vaguely coincides with the OST derivative of the field is a vague 
equivalent of the notion of "particle". After all, a particle of fixed momentum, which is a 
wave, can be described in terms of that constraint. However, [12] shows that particles can 
not be objectively defined in a curved spacetime. Equivalently, a parallel transport around 
the loop of a vector does not bring it back to the original one. I have dealt with this issue 
by emposing an upper bound on curvature which then allowed me to say that the deviation 
from that constraint is "small but finite" , and that number is a function of curvature. Again, 
however, this is rather vague and I would like to see more explicitly how it works out in case 
of curved spacetime, in future research. 
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